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^ (54) TiUe: PLANTS HAVING MODIFIED GROWTH CHARACTERISTICS AND A METHOD FOR MAKING THE SAME 

O (57) Abstract: The present invention concerns a method for modifying the growth characteristics of plants by modifying expression 
^ in a plant of a nucleic acid sequence encoding a 2xC2H2 zinc finger protein and/or modifying level and/or activity in a plant of a 
2xC2H2 zinc finger protein. The invention also relates to transgenic plants having modified growth characteristics, which plants 
have modified expression of a nucleic acid encoding a 2xC2H2 zinc finger protein. For example yield of crop plants are improved 
by the methods of the present invention. 
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Plants having modified growth characteristics and a method for 

making the same 

The present invention concerns a method for modifying plant growth characteristics. More 
specifically, the present invention concerns a method for modifying the growth characteristics 
of a plant by modifying expression of a nucleic acid encoding a zinc finger protein and/or by 
modifying the level and/or activity of a zinc finger protein in a plant, which zinc finger protein 
has two zinc finger domains of the type C2H2 (2xC2H2). The present invention also concerns 
plants having modified expression of a nucleic acid encoding a 2xC2H2 zinc finger protein 
and/or modified levels and/or activity of a 2xC2H2 zinc finger protein, which plants have 
modified growth characteristics relative to corresponding wild type plants. 

Given the ever-increasing world population, it remains a major goal of agricultural research to 
improve the efficiency of agriculture. Conventional means for crop and horticultural 
improvements utilise selective breeding techniques to identify plants having desirable 
characteristics. However, such selective breeding techniques have several drawbacks, namely 
that these techniques are typically labour intensive and result in plants that often contain 
heterogeneous genetic components that may not always result in the desirable trait being 
passed on from parent plants. Advances in molecular biology have allowed mankind to modify 
the germplasm of animals and plants in a specific and controlled way. Genetic engineering of 
plants entails the isolation and manipulation of genetic material (typically in the form of DNA or 
RNA) and the subsequent introduction of that genetic material into a plant. Such technology 
has led to the development of plants having various improved economic, agronomic or 
horticultural traits. A trait or growth characteristic of particular economic interest is high yield. 
Yield is normally defined as the measurable produce of economic value from a crop. This may 
be defined in terms of quantity and/or quality. Other important growth characteristics include 
modified architecture, modified growth rate, among others. 

The ability to influence one or more of the abovementioned growth characteristics, would have 
many applications in areas such as crop enhancement, plant breeding, production of 
ornamental plants, arboriculture, horticulture, forestry, production, of algae or plants (for 
example for use as bioreactors, for the production of substances such as pharmaceuticals, 
antibodies, or vaccines, or for the bioconversion of organic waste or for use as fuel in the case 
of high-yielding algae and plants). 
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The term "zinc finger* describes a nucleic acid-binding domain in a protein that Is folded 
around a tetrahedrally coordinated Zinc ion (Miller et al. 1985. EMBO, 4, 1609-1614). The 
amino acids that coordinate the zinc ion, are always cystein or histidine residues, however, 
diversity occurs in the sequence and length of the zinc finger domain. Zinc finger proteins may 

5 contain several zinc finger domains of the same or different type. Further variability is 
encountered in nature by association of zinc finger domains with other domains. For example, 
some zinc finger proteins are found in association with ring finger or coil-coil domains, to form 
a so-called tripartite domain. There are several types of zinc fingers, such as C2H2, C2HC, 
C2C2. C2H2 is known as the classical zinc finger domain. There are typically two criteria used 

10 to classify zinc finger proteins, the first being the type of zinc finger and the second being the 
number of zinc fingers present in the protein. Zinc finger proteins having a single C2H2 domain 
have been characterised, for example Superman from Arabidopsis and Ramosa I from maize. 
A well-characterised zinc finger protein having three C2H2 domains is the Indeterminate 1 
protein from Maize. Although the first report of this gene (Colasanti et al., Cell. 1998 May 

15 15;93(4):593-603) only mentions the presence of two zinc finger domains, a more 
sophisticated analysis, using pFAM domain search, revealed the presence of three C2H2 zinc 
finger domains. Also known are zinofinger proteins having only two C2H2 domains, for 
example ZAT10 (STZ )and SCOF-1. This subset of plant zinc finger proteins having two C2H2 
domains have been implicated in plant responses to various stresses (Sakamoto et a/., Gene 

20 248 (1-2) 23-32 (2000)). Both STZ and SCOF-1 have been used to enhance abiotic stress 
tolerance. When over-expressed, STZ has been reported to increase salt tolerance in yeast 
(Uppuner et a/., J Biol Chem. 271 (22) 12859-12866 (1996)) and over-expression of the 
SCOF-1 gene under control of the CaMV 35 S promoter has been reported to enhance cold 
tolerance in Arabidopsis thaliana (Kim et a!., Plant J. 25 (3) 247-259 (2001)). Reports of plants 

25 having modified expression of a zinc finger encoding gene (whether the zinc finger gene is 
mutated, over-expressed or otherwise) describe plants having abnormal growth 
characteristics, none of which (with the exception of cold stress tolerance in transgenic plants 
expressing SCOF-1) are desirable for crops or describe effects that are only detectable under 
particular stress conditions. 

30 

It has now been found that modifying expression in a plant of a 2xC2H2 zinc finger gene 
and/or modifying the level and/or activity in a plant of a 2xC2H2 zinc finger protein gives plants 
having modified growth characteristics. In particular it has been found that introduction into a 
plant of a 2xC2H2 zinc finger nucleic acid gives plants modified growth characteristics, such as 
35 increased yield, modified leaf architecture and altered cycle time, each relative to wild type 
plants. 
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Therefore according to one embodiment of the present invention there is provided a method 
for modifying the growth characteristics of a plant, comprising modifying expression in a plant 
of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or modifying level and/or activity in 
a plant of a 2xC2H2 zinc finger protein. 

5 

The term "modifying" as used herein is taken to mean enhancing, decreasing and/or changing 
in place and/or time. Modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger 
protein or modifying the level and/or activity of the 2xC2H2 zinc finger protein itself 
encompasses altered expression of a gene and/or altered level and/or activity of a gene 

10 product, namely a polypeptide, in specific cells or tissues, when compared to expression, level 
and/or activity of a 2xC2H2 zinc finger gene or protein in corresponding wild-type plants. The 
modified gene expression may result from modified expression of an endogenous 2xC2H2 zinc 
finger gene and/or may result from modified expression of a 2xC2H2 zinc finger gene 
previously introduced into a plant. Similarly, modified levels and/or activity of a 2xC2H2 zinc 

15 finger protein may be due to modified expression of an endogenous 2xC2H2 zinc finger 
nucleic acid/gene and/or due to modified expression of a 2xC2H2 zinc finger nucleic acid/gene 
previously introduced into a plant Modified expression of a gene/nucleic acid and/or modified 
level and/or activity of a gene product/protein may be effected, for example, by chemical 
means and/or recombinant means. 

20 

Therefore there is provided by the present invention, a method for modifying the growth 
characteristics of a plant, comprising modifying expression, level and/or activity of a 2xC2H2 
zinc finger gene or protein by recombinant means and/or by chemical means. 

25 Advantageously, modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger protein 
and/or modifying level and/or activity of the 2xC2H2 zinc finger protein itself may be effected 
by chemical means, i.e. by exogenous application of one or more compounds or elements 
capable of modifying activity of the 2xC2H2 zinc finger protein and/or capable of modifying 
expression of a 2xC2H2 zinc finger gene (which may be either an endogenous gene or a 

30 transgene introduced into a plant). The term "exogenous application 0 as defined herein is 
taken to mean the contacting or administering of a suitable compound or element to a plant 
The compound or element may be exogenously applied to a plant in a form suitable for plant 
uptake (such as through application to the soil for uptake via the roots, or in the case of some 
plants by applying directly to the leaves, for example by spraying). The exogenous application 

35 may take place on wild-type plants or on transgenic plants that have previously been 
transformed with a 2xC2H2 zinc finger nucleic acid/gene or other transgene. 
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Suitable compounds or elements for exogenous application Include 2xC2H2 zinc finger 
proteins or 2xC2H2 zinc finger nucleic acids. Alternatively, exogenous application of 
compounds or elements capable of modifying levels of factors that directly or indirectly activate 
or inactivate a 2xC2H2 zinc finger protein will also be suitable in practising the invention. Also 
5 included are antibodies that can recognise or mimic the function of 2xC2H2 zinc finger 
proteins. Such antibodies may comprise "plantibodies", single chain antibodies, IgG antibodies 
and heavy chain camel antibodies, as well as fragments thereof. 

Additionally or alternatively, the resultant effect may also be achieved by the exogenous 
10 application of an Interacting protein or activator or an inhibitor of a 2xC2H2 zinc finger 
gene/gene product. Additionally or alternatively, the compound or element may be a mutagenic 
substance, such as a chemical selected from any one or more of. N-nitroso-N-ethylurea, 
ethylene imine, ethyl methanesulphonate and diethyl sulphate. Mutagenesis may also be 
achieved by exposure to ionising radiation, sue* as X-rays or gamma-rays or ultraviolet light. 
15 Methods for introducing mutations and for testing the effect of mutations (such as by 
monitoring gene expression and/or protein activity) are well known in the art. 

Additionally or alternatively, and according to a preferred embodiment of the present invention, 
modifying expression of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or modifying 
20 level and/or activity of the 2xC2H2 zinc finger protein may be effected by recombinant means. 
Such recombinant means may comprise a direct and/or indirect approach for modifying 
expression of a nucleic acid and/or level and/or activity of a protein. 

For example, an indirect approach may comprise introducing, into a plant, a nucleic acid 
25 capable of modifying expression of the gene in question (a gene encoding a 2xC2H2 zinc 
finger protein) and or capable of modifying the level and/or activity of the protein in question (a 
2xC2H2 zinc finger protein). Examples of such nucleic acids to be introduced into a plant 
include nucleic acids encoding transcription factors or activators or inhibitors that bind to the 
promoter of a 2xC2H2 zinc finger gene or that interact with a 2xC2H2 zinc finger protein. 
30 Methods to test these types of interactions and methods for isolating nucleic acids encoding 
such interactors include yeast one-hybrid or yeast two-hybrid screens in which the 2xC2H2 
zinc finger gene/protein is used as baft. One example of such a transcription regulator is LOS2, 
described as a transcription regulator for the STZ gene. Therefore, the method of the invention 
may also be performed using LOS2, wherein expression of a 2xC2H2 zinc finger gene may be 
35 increased or further increased by decreasing expression of LOS2 in plants. 
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Also encompassed by an indirect approach for modifying expression of a 2xC2H2 zinc finger 
gene and/or for modifying level and/or activity of a 2xC2H2 zinc finger protein is the provision 
of, or the inhibition or stimulation of regulatory sequences that drive expression of a native 
2xC2H2 zinc finger gene or transgene. Such regulatory sequences may be introduced into a 
5 plant For example, the regulatory sequence to be introduced into a plant may be a promoter 
capable of driving expression of an endogenous 2xC2H2 zinc finger gene. 

A further indirect approach for modifying expression of a 2xC2H2 zinc finger gene and/or for 
modifying level and/or activity of a 2xC2H2 zinc finger protein in a plant encompasses 

10 modifying levels in a plant of a factor capable of interacting with a zinc finger protein. Such 
factors may include ligands of a 2xC2H2 zinc finger protein. Therefore, the present Invention 
also provides a method for modifying growth characteristics of a plant, comprising modifying 
expression of a gene coding for a protein which is a natural ligand of a 2xC2H2 zinc finger 
protein. Furthermore, the present invention also provides a method for modifying growth 

15 characteristics of a plant, comprising modifying expression of a gene coding for a protein which 
is a natural target/substrate of a 2xC2H2 zinc finger protein. Examples of such 
targets/substrates include stretches of DNA that are bound by the zino-finger domains. 

A direct and preferred approach on the other hand comprises introducing into a plant a nucleic 
20 acid encoding a 2xC2H2 zinc finger protein or a portion thereof or sequences capable of 
hybridising therewith, which nucleic add preferably encodes a 2xC2H2 zinc finger protein or a 
homologue, derivative or active fragment thereof. The nucleic acid may be introduced into a 
plant by, for example, transformation. 

25 Therefore, there is provided a method for modifying growth characteristics of a plant, 
comprising introducing into a plant a 2xC2H2 zinc finger nucleic acid or a portion thereof. 

The 2xC2H2 zinc finger nucleic acid may be derived (either directly or indirectly (if 
subsequently modified)) from any source provided that the sequence, when expressed in a 

30 plant, leads to modified expression of a 2xC2H2 zinc finger-encoding nucleic acid/gene and/or 
modified level and/or activity of a 2xC2H2 zinc finger protein. The 2xC2H2 zinc finger gene or 
protein may be wild type, i.e. the native or endogenous nucleic acid or polypeptide. 
Alternatively, it may be a protein or nucleic acid derived from the same or another species. The 
nucleic acid/gene may then be introduced into a plant as a transgene, for example by 

35 transformation. 
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The nucleic acid may be Isolated from a bacteria, yeast or fungi, or from a plant, algae, Insect 
or animal (including human) source. This nucleic acid may be substantially modified from its 
native form in composition and/or genomic environment through deliberate human 
manipulation. The nucleic acid is preferably obtained from a plant, whether from the same 

5 plant species in which it is to be introduced or whether from a different plant species. Further 
preferably, the nucleic acid is from a dicot, preferably from the family Brassicaceae, further 
preferably from Arabidopsls thaliana. More preferably, the nucleic acid is essentially similar to 
a nucleic acid as represented by SEQ ID NO 1, or a portion of SEQ ID NO 1, or a nucleic add 
capable of hybridising therewith or is a nucleic acid encoding an amino acid sequence 

10 essentially similar to an amino acid as represented by SEQ ID NO 2, or a homotogue, 
derivative or active fragment thereof. 

Advantageously, the methods according to the invention may also be practised using variant 
2xC2H2 zinc finger nucleic acids and variant 2xC2H2 zinc finger amino acids, preferably 
wherein the variant nucleic acids are variants of SEQ ID NO 1 and wherein the variant amino 
acids are variants of SEQ ID NO 2. Examples of variant sequences suitable in performing the 
methods of the invention include: 

(i) Functional portions of a 2xC2H2 zinc finger nucleic acid/gene; 

(ii) Sequences capable of hybridising with a 2xC2H2 zinc finger nucleic acid/gene; 

(iii) Alternative splice variants of a 2xC2H2 zinc finger nucleic acid/gene; 

(iv) Allelic variants of a 2xC2H2 zinc finger nucleic acid/gene; 

(v) Homologues, derivatives and active fragments of a 2xC2H2 zinc finger protein. 

The abovementioned variants may also be described as being "essentially similar" to a 
2xC2H2 zinc finger nucleic acid/gene, particularly to the 2xC2H2 zinc finger encoding nucleic 
acid of SED ID NO 1, or essentially similar to a 2xC2H2 zinc finger amino acid/protein, 
particularly that of SED ID NO 2. The term "essentially similar to" also Includes variants of 
SEQ ID NO 1 in the form of a complement, DNA, RNA, cDNA or genomic DNA. The variant 
nucleic acid encoding a 2xC2H2 zinc finger protein or the variant of a 2xC2H2 zinc finger 
protein may be synthesized in whole or in part, it may be a double-stranded nucleic acid or a 
single-stranded nucleic add. Also, the term encompasses a variant due to the degeneracy of 
the genetic code; a family member of the gene or protein; and variants that are interrupted by 
one or more intervening sequences. 

35 An example of a variant 2xC2H2 zinc finger nucleic acid is a functional portion of a 2xC2H2 
zinofinger gene. Advantageously, the method according to the present invention may also be 
practised using portions of a DNA or nucleic acid encoding a 2xC2H2 zinc finger protein. A 
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functional portion refers to a piece of DNA derived or prepared from an original (larger) DNA 
molecule, which DNA portion, when expressed in a plant, gives plants having modified growth 
characteristics. The portion may comprise many genes, with or without additional control 
elements or may contain spacer sequences. The portion may be made by making one or 
5 more deletions and/or truncations to the nucleic acid. Techniques for introducing truncations 
and deletions into a nucleic acid are well known in the art. Portions suitable for use in the 
methods according to the invention may readily be determined by following the methods 
described in the Examples section by simply substituting the sequence used in the actual 
Example with the portion to be tested for functionality. 

10 

An example of a further variant 2xC2H2 zinc finger nucleic acid is a sequence that Is capable 
of hybridising to a 2xC2H2 zinc finger nucleic acid, for example to any of SEQ ID MO 1, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 41, 43, 45, 47 or 49. Advantageously, the 
methods according to the present invention may also be practised using these variants. 
15 Hybridising sequences suitable for use in the methods according to the invention may readily 
be determined for example by following the methods described in the Examples section by 
simply substituting the sequence used in the actual Example with the hybridising sequence. 

The term "hybridisation" as defined herein is a process wherein substantially homologous 

20 complementary nucleotide sequences anneal to each other. The hybridisation process can 
occur entirely in solution, i.e. both complementary nucleic acids are in solution. Tools in 
molecular biology relying on such a process include the polymerase chain reaction (PCR; and 
all methods based thereon), subtractive hybridisation, random primer extension, nuclease S1 
mapping, primer extension, reverse transcription, cDNA synthesis, differential display of RIM As, 

25 and DNA sequence determination. The hybridisation process can also occur with one of the 
complementary nucleic acids immobilised to a matrix such as magnetic beads, Sepharose 
beads or any other resin. Tools in molecular biology relying on such a process include the 
isolation of poly (A+) mRNA. The hybridisation process can furthermore occur with one of the 
complementary nucleic acids immobilised to a solid support such as a nitro-cellulose or nylon 

30 membrane or immobilised by e.g. photolithography to, for example, a siliceous glass support 
(the latter known as nucleic acid arrays or microarrays or as nucleic acid chips). Tools in 
molecular biology relying on such a process include RNA and DNA gel blot analysis, colony 
hybridisation, plaque hybridisation, in situ hybridisation and microarray hybridisation. In order 
to allow hybridisation to occur, the nucleic acid molecules are generally thermally or chemically 

35 denatured to melt a double strand into two single strands and/or to remove hairpins or other 
secondary structures from single stranded nucleic acids. The stringency of hybridisation is 
influenced by conditions such as temperature, salt concentration and hybridisation buffer 
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composition. High stringency conditions for hybridisation include high temperature and/or low 
salt concentration (salts include NaCI and Narcitrate) and/or the inclusion of fbrmamide in the 
hybridisation buffer and/or lowering the concentration of compounds such as SDS (detergent) 
in the hybridisation buffer and/or exclusion of compounds such as dextran sulphate or 

5 polyethylene glycol (promoting molecular crowding) from the hybridisation buffer. Conventional 
hybridisation conditions are described in, for example, Sambrook (2001) Molecular Cloning: a 
laboratory manual, 3rd Edition Cold Spring Harbor Laboratory Press, CSH, New York, but the 
skilled craftsman will appreciate that numerous different hybridisation conditions may be 
designed in function of the known or the expected homology and/or length of the nucleic acid 

10 sequence. Sufficiently low stringency hybridisation conditions are particularly preferred (at 
least in the first instance) to isolate nucleic acids heterologous to the DNA sequences of the 
invention defined supra. An example of tow stringency conditions is 4-6x SSC / 0.1-0.5% w/v 
SDS at 37-45°C for 2-3 hours. Depending on the source and concentration of the nucleic acid 
involved in the hybridisation, alternative conditions of stringency may be employed, such as 

15 medium stringency conditions. Examples of medium stringency conditions include 1-4x SSC / 
0.25% w/v SDS at £ 45 °C for 2-3 hours. An example of high stringency conditions includes 0.1 
to 2x SSC / 0.1% wAf SDS at 60°C for 1-3 hours. The skilled man will be aware of various 
parameters which may be altered during hybridisation and washing and which will either 
maintain or change the stringency conditions. The stringency conditions may start low and be 

20 progressively increased until there is provided a hybridising nucleic acid, as defined 
hereinabove. Elements contributing to heterology include allelism, degeneration of the genetic 
code and differences in preferred codon usage. 

Another variant 2xC2H2 zinc finger nucleic acid useful in practising the methods according to 
25 the present invention is an alternative splice variant of a nucleic acid sequence encoding a 
2xC2H2 zinc finger protein. The term "alternative splice variant" as used herein encompasses 
variants of a nucleic acid sequence in which selected introns and/or exons have been excised, 
replaced or added. Such splice variants may be found in nature or may be manmade. Methods 
for making such splice variants are well known in the art Splice variants suitable for use in the 
30 methods according to the invention may readily be determined for example by following the 
methods described in the Examples section by simply substituting the sequence used in the 
actual Example with the splice variant 

Another variant 2xC2H2 zinc finger nucleic acid useful in practising the methods according to 
35 the present invention is an allelic variant of a nucleic acid encoding a 2xC2H2 zinc finger 
protein. Allelic variants exist in nature and encompassed within the methods of the present 
invention is the use of these natural alleles. Allelic variants also encompass Single Nucleotide 
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Polymorphisms (SNPs), as well as Small Insertion/Deletion Polymorphisms (INDELs). The size 
of INDELs is usually less than 100 bp. SNPs and INDELs form the largest set of sequence 
variants in naturally occurring polymorphic strains of most organisms. Allelic variants suitable 
for use In the methods according to the invention may readily be determined for example by 
5 following the methods described in the Examples section by simply substituting the sequence 
used in the actual Example with the allelic variant 

The present invention provides a method for modifying plant growth characteristics, comprising 
modifying expression in a plant of an alternative splice variant or expression in a plant of an 
10 allelic variant of a nucleic acid encoding a 2xC2H2 zinc finger protein and/or by modifying level 
and/or activity in a plant of a 2xC2H2 zinc finger protein encoded by the alternative splice 
variant or allelic variant 

Examples of variant 2xC2H2 zinc finger proteins useful in practicing the methods of the 
15 present invention are homologues, derivatives or functional fragments of a 2xC2H2 zinc finger 
protein. 

"Homologues" of a 2xC2H2 zinc finger protein encompass peptides, oligopeptides, 
polypeptides, proteins and enzymes having amino acid substitutions, deletions and/or 

20 insertions relative to the unmodified protein in question and having similar biological and 
functional activity as the unmodified protein from which they are derived. To produce such 
homologues, amino acids of the protein may be replaced by other amino acids having similar 
properties (such as similar hydrophobicity, hydrophilicity, antigenicity, propensity to form or 
break o-helical structures or p-sheet structures). Conservative substitution tables are well 

25 known in the art (see for example Creighton (1984) Proteins. W.H. Freeman and Company). 
The homologues useful in the method according to the invention have at least in increasing 
order of preference 30%, 31 %, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41 %, 42%, 
43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 52%, 54%, 56%, 58%, 60%, 62%, 64%, 66%, 
68%, 70%, 72%, 74%, 76%, 78%, 80%, 82%, 84%, 86%, 88%, 90%, 92%, 94%, 96%, 98% 

30 sequence identity or similarity to an unmodified protein. 

The percentage of identity may be calculated by using an alignment program well known in the 
art For example, the percentage of identity may be calculated using the program GAP, or 
needle (EMBOSS package) or stretcher (EMBOSS package) or the program align X, as a 
35 module of the vector NTI suite 5.5 software package, using the standard parameters (for 
example GAP penalty 5, GAP opening penalty 15, GAP extension penalty 6.6). 
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According to another embodiment of the present invention, the nucleic acid sequence useful in 
the methods of the present invention is a nucleic acid encoding a protein homologous to SEQ 
ID NO 2. 

Methods for the search and identification of 2xC2H2 zinc finger protein homologues, for 
example STZ zinc finger homologues, would be well within the realm of a person skilled in the 
art. Such methods, involve screening sequence databases with the sequences provided by the 
present invention, for example SEQ ID NO 2 (or SEQ ID NO 1), preferably in a computer 
readable format This sequence information may be available in public databases, that include 
but are not limited to Genbank (htto://www.ncbl.nlm.nih < QOv/web/Genbank> . the European 
Molecular Biology Laboratory Nucleic acid Database (EMBL) (http:/w.ebi.ac.uk/ebi-docs/embl. 
db.html) or versions thereof or the MIPS database (http://mips.gsf.de/). Different search 
algorithms and software for the alignment and comparison of sequences are well known in the 
art. Such methods Include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP uses the 
algorithm of Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the alignment of 
two complete sequences that maximises the number of matches and minimises the number of 
gaps. The BLAST algorithm calculates percent sequence identity and performs a statistical 
analysis of the similarity between the two sequences. The suite of programs referred to as 
BLAST programs has 5 different Implementations: three designed for nucleotide sequence 
queries (BLASTN, BLASTX, and TBLASTX) and two designed for protein sequence queries 
(BLASTP and TBLASTN) (Coulson, Trends in Biotechnology: 76-80, 1994; Birren et al., 
GenomeAnalysis, 1: 543, 1997). The software for performing BLAST analysis is publicly 
available through the National Centre for Biotechnology Information. 

Default blast parameters to find useful homologues of any of SEQ ID NO 1, SEQ ID NO 2 or 
any of SEQ ID NO 10 to SEQ ID NO 50, are, when comparing nucleotide sequence G (Cost to 
open a gap) 5, E (Cost to extend a gap default) 2, q (Penalty for a mismatch) -3, r (Reward for 
a match) 1, e (Expectation value (E)) 10.0, W (Word size) 11, V (Number of one-line 
descriptions) 100 and B (Number of alignments to show) 100. When comparing protein 
sequences, the default parameters are preferably G 11, E 1, e value 10.0, W 3, V 100 and B 
100. 

The above-mentioned analyses for comparing sequences, for the calculation of sequence 
identity and for the search for homologues, is preferentially done with full-length sequences or 
within a conserved region of the sequence. Therefore, these analyses may be based on a 
comparison of certain regions such as conserved domains, motifs or boxes. 
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The identification of such domains or motifs for examples the motif and boxes as represented 
by SEQ ID NO 5, 6, 7, 8 and 9, would also be well within the realm of a person skilled in the art 
and involves for example, a computer readable format of proteins of the present invention, the 
use of alignment software programs and the use of publicly available information on protein 
domains, conserved motifs and boxes. This protein domain information is available in the 
PRODOM (http.7/www.blochem.uc^ PIR 
(http://pir.georgetown.edu/) or pFAM (http://pfam.wusti.edu/) database. For the identification of 
Zinc finger domains, such as the 2xC2H2 zinc finger domain, pFAM is preferred. Sequence 
analysis programs designed for motif searching may be used for identification of fragments, 
regions and conserved domains as mentioned above. Preferred computer programs would 
Include but are not limited to MEME, SIGNALSCAN, and GENESCAN. A MEME algorithm 
(Version 3.0) may be found in the GCG package; or on the Internet site 
http://www.sdsc.edu/MEME/meme. SIGNALSCAN version 4.0 information is available on the 
Internet site http://biosci.cbs.umn.edu/software/sigscan.html. GENESCAN may be found on 
the Internet site http://gnomic.stanford.edu/GENESCANW.html. 

At present, zinc finger motifs are subdivided in more than 40 different classes as can be found 
in the Pfam database of protein families present at the Sanger institute 
(http://www.sanger.ac.Uk/Software/Pfam/browse/Z.shtml). 

The C2H2 zinc finger (Zf-C2H2) motif is the classical zinc finger domain. It was first recognized 
in the transcription factor IIIA (TFIIIA) of Xenopus (Miller at al. 1985). The domain is typically 
25 to 30 amino-acid residues in length. The following pattern describes the zinc finger *X-C- 
X(1-5)-C-X3-*X5-*X2-H-X(3-6)-[H/C] where X can be any amino acid, and numbers in brackets 
indicate the number of residues. The positions marked * are those that are important for the 
stable folding of the zinc finger. The final position can be either his or cys, while still being a 
C2H2 zinc finger domain. In view of recent publications on the design of zinc finger domains it 
becomes feasible also to replace one or more of the Cys or His amino acids, whilst still 
retaining the original functionality of the C2H2 domain. The residues separating the second 
Cys and the first His are mainly polar and basic. The canonical C2H2 zinc finger is composed 
of two short beta strands followed by an alpha helix. DNA binding of the zinc finger motif is 
mediated by amino terminal part of the alpha helix which binds the major groove in DNA 
binding zinc fingers. C2H2 domains have been shown to interact with RNA, DNA and proteins. 
The tetracoordination of a Zinc ion by the conserved cystein and histidine residues determines 
the conserved tertiary structure of the motif. Conserved hydrophobic residues are commonly 
found at positions -2 and also at 4 amino acids after the second cystein (that participates in 
zinc binding) and at position three before the first histidine (that participates in zinc binding). In 
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plant mutti zinc finger proteins, spacing between the C2H2 domains Is generally about 15 to 
about 65 amino adds. 

Thus, plant zinc finger proteins are characterized by long spacers of diverse lengths between 
5 adjacent fingers. Moreover, they are characterised by a highly conserved sequence of six 
amino adds, located within a putative DNA-contacting surface of each finger. Two forms of 
such conserved sequence are most commonly found in plant C2H2 zinc fingers, the QALGGH 
(SEQ ID NO 5) and the NNM/WQMH (SEQ ID NO 6). Despite the high sequence conservation 
of the QALGGH, some variants or the so-called 'modified type' occur in nature where one or 
10 two amino adds can have a different form, most typically the +1 "Q w can be a *GY K" or "R" 
(these amino acids share the same turn-like characteristic), the +2 "A" can be "S" (both of 
which share the characteristic of being small amino acids) or the +3" a L can be "P (these two 
amino acids are both hydrophobic). The QALGGH-motif as used herein comprises all these 
variants. In the NNM/WQMH motif at position 3 there is mostly an a W or a "W. 
15 Therefore, the present invention provides a method as described hereinabove, wherein said 
2xC2H2 zinc finger protein comprises a QALGGH motif. Further, The present invention 
provides a s described hereinabove, wherein said 2xC2H2 zinc finger protein comprises a 
NNM/WQMH motif. 

20 According to one embodiment of the invention, both C2H2 domains are of the same type. More 
preferably, both C2H2 zinc finger domains have the same conserved GALGGH or 
NNM/WQMH motif. According to another embodiment, each C2H2 zinc finger domain has a 
different conserved motif. 

25 Accorcling to one embodiment, the 2xC2H2 protein useful in the methods of the present 
invention is characterized by an EAR motif, which is an ERF-Associated amphiphilic 
repression motif. This motif has been recognized in two unrelated types of transcription factors, 
namely the ERF transcription factors of the AP2 type and in the zinc finger transcription 
factors. In the latter class, the EAR motif is generally located at the C-terminus of the protein. 

30 The pattern for the EAR motif has the conserved sequence hDLNh(X)P (SEQ ID NO 7), where 
tt h w is a hydrophobic residue (any one of A,C,F,G,H,I,K,L,M,R,T,V,W,Y) most typically UF/I and 
where "X" can be one (any amino acid) or no amino acid. A characteristic feature of the EAR 
motif is the alternation of hydrophilic and hydrophobic residues with the aspartic acid (D) 
residue being amphiphilic. Ohta et aL (The plant cell, 2001, 13, p1 959-1 968), which reference 

35 is cited herein by reference, previously characterized EAR motifs present in 2xC2H2 zinc 
finger proteins. 
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Therefore, the present invention provides a method as described hereinabove, wherein the 
2xC2H2 zinc finger protein comprises an EAR motif. According to one embodiment, the EAR 
motif is located in the Oterminal region of the protein, preferably between the second zinc 
finger domain and the C-terminus. 

5 

According to a further embodiment, the zinc finger proteins used in the methods of the present 
invention have two zinc finger domains and a nuclear localization signal (B-box). A cluster of 
basic amino acids that resembles the B-box (Basic box) were described by Chua et al. (EMBO 
1992- 11, 241-9) and were hypothesized to be a nuclear localization signal for the protein. 

10 These have been recognized in 2xC2H2 proteins (Sakamoto et al., Gene 248 (2000) 23-32). 
The cluster is rich in Lysine (K) and Arginine (R) residues. A consensus sequence defining the 
most frequent form of the B-box in 2xC2H2 genes is KR(S)KRXR (SEQ ID NO 8) where "S" at 
the 3rd position may be absent or present However other variants may occur in nature that 
still retain the characteristic of being a charged region rich in basic amino acids. The location of 

15 the basic box is most frequently at the N-terminus of the protein, but can also occur in other 
locations. It has been speculated that due to its basic nature the B-box could also participate in 
DNA binding. 

Accordingly, the present invention provides a method as described hereinabove, wherein the 
20 2xC2H2 zinc finger protein further comprises a B-box. According to one embodiment the B-box 
is located in the N-terminal region of the zinc finger protein. Preferably the proteins useful in 
the methods of the present invention have a B-box located between the N-terminus and the 
first zinc finger domain. 

25 According to a further embodiment, the zinc finger proteins useful in the methods of the 
present invention have two C2H2 zinc finger domains and an L-box. A conserved motif, named 
L-box, of yet unknown function has been identified in 2xC2H2 proteins and has been described 
previously by Sakamoto et al. (Gene 248 (2000) 23-32). The L-box is typically located at the N- 
terminus, between the B-box and the first C2H2 zinc finger. The L-box is represented by the 

30 sequence EXEXXAXCLXXL (SEQ ID NO 9). This region may be involved in protein-protein 
interactions. Zinc finger proteins lacking the L-box, may for example have serine rich regions at 
a similar position, which regions are putative sites for protein-protein interactions. 

Therefore, the present invention provides a method as described hereinabove, wherein the 
35 2xC2H2 protein comprises an L-box. 
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Particular zinc finger homologues useful in the methods of the present invention have one or 
more of the conserved motifs as depicted in SEQ ID NO 5, 6, 7, 8 and 9, or motife that are 
80% identical to these motifs or motifs that have conserved substitutions of amino acids. The 
2xC2H2 protein as set forth in SEQ ID NO 2 comprises all the boxes as set forth in SEQ ID NO 
5 5, 7, 8 and 9. All its paralogues and orthologues also comprise all of these boxes. 

Homologues of a 2xC2H2 protein as presented in SEQ ID NO 2 and isolated from Arabidopsis 
thaliana, that are useful in the constructs and the methods of the present invention are also 
identified in other plant species. 

10 

Two special forms of homologue, orthologues and paralogues, are evolutionary concepts used 
to describe ancestral relationships of genes. The term "paralogue" relates to a gene- 
duplication within the genome of a species leading to paralogous genes. The term "orthologue" 
relates to a homologous gene in different organisms due to ancestral relationship. The term 
15 "homologue" as used herein also encompasses paralogues and orthologues of the proteins 
useful in the methods according to the invention. 

Othologues in other plant species may easily be found by performing a so-called reciprocal 
blast search. Orthologous genes can be identified by querying one or more gene databases 

20 with a query gene or protein of interest (SEQ ID NO 1 or 2), using for example BLAST 
program. The highest-ranking subject genes that result from the search are then again 
subjected to a BLAST analysis, and only those subject genes that match again with the query 
sequence (SEQ ID NO 1 or 2) are retained as true orthologous genes. For example, to find a 
rice orthologue of an Arabidopsis thaliana gene, one may perform a BLASTN or TBLASTX 

25 analysis on a rice database such as (but not limited to) the Oryza sativa Nipponbare database 
available at the NCBI website (http://www.ncbi.nlm.nih.gov) or the genomic sequences of rice 
(cultivars indica or japonica). In a next step, the obtained rice sequences are used in a reverse 
BLAST analysis using an Arabidopsis database. The results may be further refined when the 
resulting sequences are analysed with ClustalW and visualised in a neighbour joining tree. 

30 The method can be used to identify orthologues from many different species. 

The closest homologues in other species (orthologues of the protein of SEQ ID NO 2), include 
those from a variety of dicot and monocot plants, for example from Datisca glomerata 
(AF119050_1, AAD26942, SEQ ID NO 10 and 11), from soybean (T09602, SCOF-1, SEQ ID 
35 NO 12 and 13), Medicago sativa (CAB77055.1, SEQ ID NO 14 and 15), from tobacco 
(T01985, SEQ ID NO 16 and 17) from rice, (AF332876JI, AAK01713.1, SEQ ID NO 18 and 
19), from petunia (BAA05079.1, SEQ ID NO 20 and 21), from wheat (S39045 and BAA03901, 
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WZF1, SEQ ID NO 22 and 23), from Capsicum annum (SEQ ID NO 24 and 25), from turnip 
(T14408, T14409) and from sugarcane (CA279020). 

Close homologues of the same species (paralogues of the protein of SEQ ID NO 2 from 

5 Arabldopsis thaliana) are described below. 

The MIPS database contains the sequence of the Arabldopsis thaliana genome with prediction 
and functional annotation of the proteins encoded. Searching this database with the STZ gene 
of SEQ ID NO 1 (MIPS accession number At1g27730), showed that in the Arabldopsis 
genome there are 2 genes encoding very close homologues of SEQ ID NO 2, At5g43170 

10 (NMJI23683, SEQ ID NO 32 and 33) and At5g04340 (NM_120516 SEQ ID NO 28 and 29), 
and 3 others with high similarity: At3gl9580 (NM_1 12848, SEQ ID NO 26 and 27), At5g67450 
(NMJI26145, SEQ ID NO 34 and 35) and At3g49930 (NM_1 14853, SEQ ID NO 30-31). These 
genes are spread over 3 chromosomes, 1, 3 and 5. Similarly, a number of paralogues of the 
orthologue in Petunia have been isolated and sequenced. Advantageously, paralogues from 

15 the same species may be used in the methods of the present invention. 

Furthermore, a number of family members of the STZ protein of SEQ ID NO 2 have been 
found in Arabisopsis. The STZ gene and protein of SEQ ID NO 1 and 2 have been previously 
published In the database under the MIPS accession number At1g27730 or in Genbank under 

20 the accession numbers NPJI 74094.1, X95573 or CAA64820. Additionally, several other 
cDNA's, isolated from other tissues or at different developmental stages of Arabisopsis have 
been reported and encode the same protein as that of SEQ ID NO 2. S uch sequences 
sequences deposited under the Genbank accession number AY034998, NM_1 02538, 
AC12375, X95573, AY063006, X98671, X98670, or AF250336. These isolates illustrate the 

25 differential expression of the STZ gene in different plant tissues at different developmental 
stages. The differential regulation of these different cDNA's is reflected by the differences at 
their 5'UTR and the 3'UTR regions, white the encoded protein remains the same. 
Advantageously, the members of the same gene family as SEQ ID NO 1 or the members of 
the same family of any of the orthologues of SEQ ID NO 1 , may be used in the methods of the 

30 present invention. 

Other close homologues useful in the methods of the present invention are the sequences as 
deposited in the public database under the following accession numbers, which sequences are 
herein incorporated by reference: homologues isolated from Petunia: BAA21 923.1, 
35 BAA21922.1, BAA21926.1, BAA21925.1, BAA19110.1, BAA19926.1, BAA21924.1, 
BAA19111.1, BAA21921.1, BAA19114.1, BAA05076.1, BAA05079.1, CAA43111.1, 
BAA21920.1, BAA21919.1, BAA05077.1, BAA05078.1, BAA20137.1; homologues isolated 
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from Arabidopsis: CAA67229.1, BAC43454.1, NPJI96054.1, AAM67193.1, NPJI99131.1, 
NP_188592.1, NP_201546.1, NP_190562.1, NP_182037.1, BAC43008.1, Q8VWG3, 
CAC86393.1, CAC86168.1, CAC86167.1, CAC86166.1, CAB67667.1, CAC01747.1, 
CAB90936.1, CAB90935.1, CAB80245.1, CAB41188.1, CAA18741.1, CAA67234.1, 
5 CAA67236.1, GAA67231.1, CAA67230.1, CAA67228.1, CAA67235.1, CAA67233.1, 
CAA67232.1, CAA67229.1, CAA64820.1 and homologues isolated from rice: BAB16855.1, 
AAO06972.1 , CAC09475.1 , BAB63718.1 f P0683F02.21 , BAB67885. 1 , P0031 D1 1 .19, 
BAB64114.1, AAK01713.1, AF332876_1, AAL76091.1, BAB67879.1, P0031D11.12 and 
BAC15513.1. 

10 

A phylogenetlc tree may be constructed with all the homologues, paralogues and orthologues 
are defined herein above. Multiple alignment q may be made using clustal W present in the 
VNTi (version 5.0) program with for example Gap opening penalty 10 and Gap extention 5. For 
making a phylogenetic tree the Phylic software package available at 
15 http://evolution.Qenetics-washington.edu/DhvliD.html may be used. Sequences clustering 
around SEQ ID NO 1 or SEQ ID NO 2, Identify genes or proteins suitable for use in the 
methods of the present invention. 

The sequence of SEQ ID NO 2 and its rice orthologue AF332876 (SEQ ID NO 19) have 36% 
20 sequence identity when using the program Needle with the parameters Gap penalty 5 and Gap 
extension penalty 6. Therefore, homologues particularly useful in the methods of the present 
invention are homologues having 36% or more sequence identity with the 2xC2H2 zinc finger 
protein as presented in SEQ ID NO 2 or having 36% or more sequence identity to the closest 
orthologue of SEQ ID NO 2 from another species. 

25 

Preferred homologues useful in practicing the methods of the present invention are plant 
homologues, i.e. proteins obtained from a plant nucleic acid. More preferably, the nucleic acid 
sequence is from a dicot, more preferably from the family Brassicaceae, further preferably from 
Arabidopsis thatiana. 

30 

Preferably the 2xC2H2 zinc finger protein useful in the methods of the present invention 
belongs to the same gene family as the salt tolerant zinc finger protein (STZ) of Arabidopsis 
thaliana, or is a homologues thereof. The name ZAT10 can also be used to identify the STZ 
zinc finger protein of Arabidopsis thaliana. 

35 

Another variant of a zinc finger protein useful in the methods of the present invention is a 
substitutional variant The term "Substitutional variants* of a protein refers to those variants in 
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which at least one residue in an amino acid sequence has been removed and a different 
residue inserted in its place. Amino add substitutions are typically of single residues, but may 
be dustered depending upon functional constraints placed upon the polypeptide; insertions will 
usually be of the order of about 1-10 amino add residues, and deletions will range from about 
5 1-20 residues. Preferably, amino acid substitutions comprise conservative amino acid 
substitutions. Particular substitutional variants of the C2H2 zinc finger protein are substitutional 
variants in which one or more of the conserved Cys and/or His residues is replaced, whilst 
retaining the same zinc finger fundionality. To retain the same functionality, the residues 
around these conserved Cys of His residues may also be substituted. 

10 

"Insertional variants" of a protein are those in which one or more amino add residues are 
introduced into a predetermined site in said protein. Insertions can comprise amino-terminal 
and/or carboxy-terminal fusions as well as intra-sequence insertions of single or multiple amino 
acids. Generally, insertions within the amino acid sequence will be smaller than amino- or 

15 carboxy-terminal fusions, of the order of about 1 to 10 residues. Examples of amino- or 
carboxy-terminal fusion proteins or peptides indude the binding domain or activation domain of 
a transcriptional activator as used in the yeast two-hybrid system, phage coat proteins, 
(histidine)e-tag, glutathione S-transferase-tag, protein A, maltose-binding protein, dihydrofolate 
reductase, Tag«100 epitope, c-myc epitope, FLAG®-epitope, lacZ, CMP (calmodulin-binding 

20 peptide), HA epitope, protein C epitope and VSV epitope. 

"Deletion variants" of a protein are characterised by the removal of one or more amino acids 
from the protein. Amino acid variants of a protein may readily be made using peptide synthetic 
techniques well known in the art, such as solid phase peptide synthesis and the like, or by 

25 recombinant DNA manipulations. Methods for the manipulation of DNA sequences to produce 
substitution, insertion or deletion variants of a protein are well known in the art. For example, 
techniques for making substitution mutations at predetermined sites in DNA are well known to 
those skilled in the art and indude M13 mutagenesis, T7-Gen in vitro mutagenesis (USB, 
Cleveland, OH), QuickChange Site Directed mutagenesis (Stratagene, San Diego, CA), PCR- 

30 mediated site-directed mutagenesis or other site-directed mutagenesis protocols. 

The term "derivatives" refers to peptides, oligopeptides, polypeptides, proteins and enzymes 
which may comprise substitutions, deletions or additions of naturally and non-naturally 
occurring amino acid residues compared to the amino acid sequence of a naturally-occurring 
35 form of the 2xC2H2 protein such as for example the 2xC2H2 zinc finger protein as presented 
in SEQ ID NO 2. "Derivatives* of a 2xC2H2 zinc finger protein encompass peptides, 
oligopeptides, polypeptides, proteins and enzymes which may comprise naturally occurring 
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altered, glycosylated, acylated or non-naturally occurring amino acid residues compared to the 
amino acid sequence of a naturally-occurring form of the polypeptide, A derivative may also 
comprise one or more non-amino acid substituents compared to the amino acid sequence from 
which it is derived, for example a reporter molecule or other ligand, covalently or non- 
5 covalently bound to the amino acid sequence such as, for example, a reporter molecule which 
is bound to facilitate its detection, and non-naturally occurring amino acid residues relative to 
the amino acid sequence of a naturally-occurring protein. 

Another variant of a 2xC2H2 zinc finger protein useful in the methods of the present invention 
10 is an active fragment of a zinc finger protein. "Active fragments 0 of a 2xC2H2 zinc finger 
protein encompasses at least five contiguous amino acid residues of a protein, which residues 
retain similar biological and/or functional activity to the naturally occurring protein. For 
example, useful fragments comprise at least 10 contiguous amino acid residues of a 2xC2H2 
zinc finger protein. Other preferred fragments are fragments of a 2xC2H2 zinc finger protein 
15 starting at the second or third or further internal methionin residues. These fragments originate 
from protein translation, starting at internal ATG codons. Functional fragments of a 2xC2H2 
zinc finger protein usefol in practising the methods of the present invention may have one, two 
or no C2H2 domains, without affecting its functionality in the methods of the present invention. 

20 According to a preferred feature of the present invention, enhanced or increased expression of 
a nucleic acid encoding a 2xC2H2 zinc finger protein is envisaged. Methods for obtaining 
enhanced or increased expression of genes or gene products are well documented in the art 
and include, for example, over-expression driven by a strong promoter, the use of transcription 
enhancers or translation enhancers. The term over-expression as used herein means any form 

25 of expression that is additional to the original wild-type expression level. Preferably the nucleic 
acid to be introduced into the plant and/or the nucleic acid that is to be overexpressed in the 
plant is in the sense direction with respect to the promoter to which it is operably linked. 
Preferably, the nucleic acid sequence represented by SED ID NO 1 is over-expressed in a 
plant However, it should be clear that the applicability of the invention is not limited to use of 

30 the nucleic add represented by SEQ ID NO 1 nor to the nucleic acid sequence encoding the 
amino acid sequence of SEQ ID NO 2, but that other nucleic acid sequences encoding 
homologues, derivatives or active fragments of SED ID NO 1 or SED ID NO 2 may be useful in 
the methods of the present invention. Examples of nucleic acids or proteins are provided in 
SEQ ID NO 10 to SEQ ID NO 50. 

35 

Alternatively and/or additionally, increased expression of a 2xC2H2 encoding gene or 
increased level and/or activity of a 2xC2H2 protein in a plant cell, is achieved by mutagenesis. 
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For example these mutations may be responsible for altered control of the 2xC2H2 gene, 
resulting in more expression of the gene, relative to the wild-type gene. Mutations can also 
cause conformational changes in a protein, resulting in more activity and/or higher levels of the 
2xC2H2 protein. 

5 

Modifying gene expression (whether by a direct or indirect approach) encompasses altered 
transcript levels of a gene. Altered transcript levels may be sufficient to induce certain 
phenotypic effects, for example via the mechanism of cosuppression. Here the overall effect of 
introduction of a transgene is that there is less activity in the cell of the protein encoded by a 

10 native gene having homology to the introduced transgene. Therefore, according to another 
embodiment of the present invention, there is provided a method for modifying growth 
characteristics in a plant, comprising decreasing expression of a gene encoding a 2xC2H2 zinc 
finger protein or decreasing level and/or activity of a 2xC2H2 zinc finger protein. Examples of 
decreasing expression, level and/or activity of a protein in a cell are well documented in the art 

15 and include, for example, downregulation of expression by anti-sense techniques, RNAi 
techniques, small interference RNAs (siRNAs) and microRNA (miRNA) . 

Another method for downregulation of gene expression or gene silencing comprises use of 
ribozymes, for example as described in Atkins et al. 1994 (WO 94/00012), Lenee et al. 1995 
20 (WO 95/03404), Lutziger et al. 2000 (WO 00/00619), Prinsen et al. 1997 (WO 97/3865) and 
Scott et al. 1997 (WO 97/38116). 



Gene silencing may also be achieved by insertion mutagenesis (for example, T-PNA insertion 
or transposon insertion) or by gene silencing strategies as described by, among others, Angell 
25 and Baulcombe 1998 (WO 98/36083), Lowe et al. 1989 (WO 98/53083), Lederer et al. 1999 
(WO 99/15682) or Wang et al. 1999 (WO 99/53050). 

Expression of an endogenous gene may also be reduced if it contains a mutation. Such a 
mutation or such a mutant gene may be isolated and introduced into the same or different 
30 plant species in order to obtain plants having modified growth characteristics. Examples of 
such mutants are dominant negative mutants of a 2xC2H2 zinc finger gene. 

Genetic constructs aimed at silencing gene expression may comprise the 2xC2H2 zinc finger 
nucleic acid, for example as represented by SEQ ID NO 1 (or one or more portions thereof or a 
35 sequence capable of hybridising therewith), in a sense and/or antisense orientation relative to 
the promoter sequence. The sense or antisense copies of at least part of the endogenous 
gene in the form of direct or inverted repeats may also be utilised in the methods according to 
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the invention. The growth characteristics of plants may also be modified by introducing into a 
plant at least part of an antisense version of the nucleotide sequence represented by SEQ ID 
N01. 

5 According to a further embodiment of the present invention, genetic constructs and vectors to 
facilitate introduction and/or to facilitate expression of the 2xC2H2 zinc finger nucleotide 
sequences useful in the methods according to the invention are provided. Therefore, according 
to the present invention, there is provided a construct comprising: 

Q) a nucleic acid capable of modifying expression of a nucleic acid encoding a 

10 2xC2H2 zinc finger protein and/or modifying level and/or activity of a 2xC2H2 zinc finger 
protein; 

(ii) one or more control sequence capable 'of driving expression of the nucleic acid 
sequence of (i); and optionally 

(iii) a transcription termination sequence. 

15 

Constructs useful in the methods according to the present invention may be constructed using 
recombinant DNA technology well known to persons skilled in the art The gene constructs 
may be inserted into vectors, which may be commercially available, suitable for transforming 
into plants and suitable for expression of the gene of interest in the transformed cells. 
20 Preferably the genetic construct is a plant expression vector. 

The nucleic acid according to (i) is advantageously any of the nucleic acids described 
hereinbefore. A preferred nucleic acid is the nucleic acid represented by SEQ ID NO 1 or a 
variant thereof as hereinbefore defined, or is a nucleic acid sequence encoding a sequence 
25 represented by SEQ ID NO 2 or a variant as hereinbefore defined. For example such variants 
encode a protein as presented in any of SEQ ID NO 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 35, 37, 39, 42, 44, 46, 48 and 50. 

The terms "regulatory element" and "control sequence" are used herein interchangeably and 
30 are to be taken in a broad context to refer to regulatory nucleic acids capable of effecting 
expression of the sequences to which they are operably linked. Encompassed by the 
aforementioned terms are promoters. A "promoter* encompasses transcriptional regulatory 
sequences derived from a classical eukaryotic genomic gene (including the TATA box which is 
required for accurate transcription initiation, with or without a CCAAT box sequence) and 
35 additional regulatory elements (i.e. upstream activating sequences, enhancers and silencers) 
which alter gene expression in response to developmental and/or external stimuli, or in a 
tissue-specific manner. Also included within the term is a transcriptional regulatory sequence 
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of a classical prokaryotlc gene, in which case it may include a -35 box sequence and/or -10 
box transcriptional regulatory sequences. The term "regulatory elemenf also encompasses a 
synthetic fusion molecule or derivative which confers, activates or enhances expression of a 
nucleic acid molecule In a cell, tissue or organ. The term "operably linked" as used herein 
5 refers to a functional linkage between the promoter sequence and the gene of interest, such 
that the promoter sequence is able to initiate transcription of the gene of interest. Preferably, 
the gene of interest is operably linked to a promoter in a sense direction. 

Advantageously, any type of promoter may be used to drive expression of the nucleic acid 
10 sequence depending on the desired outcome. 

Promoters useful for the present invention are described in EP 03075331.3, which promoters 
and sequences are incorporated herein by reference. 

Other examples of preferred promoters are presented in Table I (a) to (c), which promoters or 
15 derivatives thereof are useful in the methods and/or in making the constructs of the present 
invention. Accordingly, genetic constructs comprising of the nucleic acids of (i), for example a 
2xC2H2 nucleic acid, and at least part of a promoter from Table I (a) to (c) or from EP 
03075331.3, preferably, wherein said parts are operably linked, are also provided by the 
present invention. 

20 

According to onother embodiment, the nucleic acid of (i) is operably linked to a constitutive 
promoter. The term "constitutive 1 ' as defined herein refers to a promoter that is expressed 
substantially continuously. Furthermore, preferably the constitutive promoter is a ubiquitous 
promoter, which is expressed in more than one, preferably in most or substantially all tissues 

25 of the plant. Preferably, the constitutive promoter to be used in the methods of the present 
invention, or cloned in the genetic constructs of the present invention, is a plant promoter, 
preferably a constitutive promoter, such as a GOS2 promoter or a promoter with similar 
strength and/or similar expression pattern. Preferably plant promoters derived from a plant 
nucleic acid are used. Alternatively, promoters operable in plant, such as promoters derived 

30 from plant pathogens are used. 

According to another embodiment of the invention, the nucleic acid of (i) is operably linked to a 
plant promoter, preferably a tissue-preferred promoter. The term "tissue-preferred" as used 
herein refers to a promoter that is expressed predominantly in at least one tissue or organ. For 
35 example, the tissue-preferred promoter is a seed-preferred promoter, such as a pWS18 
(Joshee et al. Plant Cell Physiol. 1998 Jan;39(1):64-72. ) or a promoter of similar strength 
and/or similar expression pattern. 
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Promoters with similar strength and/or similar expression pattern may be found by coupling the 
promoter to a reporter gene and checking the function of the reporter gene in different tissues 
of a plant. One suitable reporter gene is beta-glucuronidase and the colorimetric GUS staining 
5 to visualize the beta-glucuronidase activity in a plant tissue is well known to a person skilled in 
the art 

Table I (a): flower preferred promoters useful in the present invention. Sequences of these 
promoters are described in the cited reference, which sequences are herein incorporated by 
10 reference. 



Gene 


Expression 


Reference 


AtPRP4 


flowers 


http://salus.medium.edu/mmg/tiemey/htrnl 


chalene synthase (chsA) 


flowers 


Van der Meer, et al., Plant Mol. Biol. 15, 95- 
109, 1990. 


LAT52 


anther 


Twel! etal Mol. Gen Genet 217:240-246 
(1989) 




flowers 





Table I (b): seed-preferred promoters useful in the present invention. Sequences of these 
15 promoters are described in the cited reference, which sequences are herein incorporated by 
reference. 



Gene 


Expression 


Reference 


seed-specific genes 


seed 


Simon, et aL, Plant Mol. Biol 5: 191, 1985; 
Scofield, et a/., J. Biol Chem. 262: 12202, 
1987.; Baszczynski, etaL, Plant MoL Biol. 14: 
633, 1990. 


Brazil Nut albumin 


seed 


Pearson, et al., Plant Mol. Biol. 18: 235-245, 
1992. 


legumln 


seed 


Ellis, et al., Plant Mol. Biol. 10: 203-214, 
1988. 


glutelin (rice) 


seed 


Takaiwa, et al., Mol. Gen. Genet. 208: 15-22, 
1986; Takaiwa, et aL. FEBS Letts. 221: 43- 
47, 1987. 


zein 


seed 


Matzke et al Pfant Mol Biol, 14(3):323-32 
1990 


napA 


seed 


Staiberg, etal, Planta 199: 515-519, 1996. 
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uuhoAt 1 MW and HMW d lit en in- 1 


AnriosDSfm 

Kyi luuopoi i ■ » 


Mol Gen Genet 216:81-90, 1989; NAR 
17:481-2, 1989 


Wiloal Orn 


cppH 
oocu 


Albani et al, Plant Cell, 9: 171-184, 1997 


wneat oc, p, Y-giiaams 


CI lUUwpCI III 


EMBO 3*1409-15 1984 


barley Itrl promoter 


endosperm 




barley B1, C, D, nordein 


endosperm 


Thanr Annl f*2on OQ ■ «i ^_f>^ 1QQQ* Plant I 

i neor /\ppi oen ao.i zoo-o^, 1 999, i-iani a 
4:343-66, 1993; Mol Gen Genet 250:750-60, 

1QQR 


barley DOF 


endosperm 


Mena et al, The Plant Journal, 116(1): 53-62, 

1QQA 


bfz2 


endosperm 


EP991 06056.7 


synthetic promoter 


endosperm 


\/1r*»n+ek.r^2»rhsiirkcei a/ a/ Plant 1 i3 m R9Q . 

640, 1998. 


rice prolamin NRP33 


enaosperm 


Wn of al Plant Cell Phx/^lolnav 39fR^ 885- 
vvu w elf, • icti it wii n lyoi v«w»y y o / oovr 

OOV| 1 TOO 


nee cc-globulin Glb-1 


onrl rvonarm 

ena uspemi 


Wlj of a/ Plant Cell Phvslotoav 39(8) 885- 
889, 1998 


riceOSHI 


embryo 


Sato ef a/, Proc. Natl. Acad. Sci. USA, 93: 
8117-8122,1996 


rice orglobulin REB/OHP-1 


endosperm 


Nakase et al. Plant Mol. Biol. 33: 513-522, 
1997 


rice ADP-glucose PP 


endosperm 


Trans Res 6:157-68, 1997 


maize ESR gene family 


endosperm 


Plant J 12:235-46, 1997 


sorgum vkafirin 


endosperm 


PMB 32:1029-35, 1996 ; 


KNOX 


embryo 


Postma-Haarsma et al, Plant Mol. Biol. 
39:257-71,1999 


rice oleosin 


embryo and 
aleuron 


Wu et at, J. Biochem., 123:386, 1998 


sunflower oleosin 


seed (embryo 
and dry seed) 


Cummins, et a/., Plant Mol. Biol. 19: 873-876, 
1992 



Table I (c): constitutive promoters useful In the present invention. Sequences of these 
promoters are described in the cited reference, which sequences are herein incorporated by 
reference. 



Gene 


Expression 


Reference 


Actin 


constitutive 


McElroy etal, Plant Cell, 2: 163-171, 
1990 


CAMV35S 


constitutive 


Odell et al, Nature, 313: 810-812, 
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1985 


CaMV19S 


constitutive 


Nilsson et al., Physiol. Plant 
700:456-462, 1997 


GOS2 


constitutive 


de Pater etal, Plant J Nov;2(6):837- 
44, 1992 


ubiquitin 


constitutive 


Christensen et a/, Plant Mol. Biol. 

4 A* R7R_AAQ 4QQ9 


rice cyciophilin 


constitutive 


Buchholz et al, Plant Mol Biol. 25(5): 
837-43, 1994 


maize H3 histone 


constitutive 


Lepettt et al, Mol. Gen. Genet. 
231:276-285, 1992 


acBn2 


constitutive 


An et al, Plant J. 10(1); 107-121, 
1996 



Optionally, one or more terminator sequences may also be used in the construct introduced 
into a plant. The term "terminator" encompasses a control sequence which is a DNA sequence 
at the end of a transcriptional unit which signals 3' processing and polyadenylation of a primary 
5 transcript and termination of transcription. Additional regulatory elements may include 
transcriptional as well as translation^ enhancers. Those skilled in the art will be aware of 
terminator and enhancer sequences which may be suitable for use in the invention. Such 
sequences would be known or may readily be obtained by a person skilled in the art. 

10 The genetic constructs of the invention may further include an origin of replication sequence 
which is required for maintenance and/or replication in a specific cell type. One example is 
when a genetic construct is required to be maintained in a bacterial cell as an episomal genetic 
element (e.g. plasmid or cosmid molecule). Preferred origins of replication include, but are not 
limited to, the fl-ori and colE1 . 

15 

The genetic construct may optionally comprise a selectable marker gene. As used herein, the 
term "selectable marker gene" includes any gene which confers a phenotype on a cell in which 
it is expressed to facilitate the identification and/or selection of cells which are transfected or 
transformed with a genetic construct of the invention. Suitable markers may be selected from 
20 markers that confer antibiotic or herbicide resistance. Cells containing the recombinant DNA 
will thus be able to survive in the presence of antibiotic or herbicide concentrations that kill 
untransformed cells. Examples of selectable marker genes include genes conferring resistance 
to antibiotics (such as nptll encoding neomycin phosphotransferase capable of 
phosphorylating neomycin and kanamycin, or hpt encoding hygromycin phosphotransferase 
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capable of phosphorylating hygromycin), to herbicides (for example bar which provides 
resistance to Basta; aroA or gox providing resistance against glyphosate), or genes that 
provide a metabolic trait (such as manA that allows plants to use mannose as sole carbon 
source). Visual marker genes result In the formation of colour (for example beta-glucuronidase, 
5 GUS), luminescence (such as luciferase) or fluorescence (Green Fluorescent Protein, GFP, 
and derivatives thereof). Further examples of suitable selectable marker genes include the 
ampicillin resistance (Ampr), tetracycline resistance gene (Tcr), bacterial kanamycin resistance 
gene (Kanr), phosphinothricin resistance gene, and the chloramphenicol acetyltransferase 
(CAT) gene, amongst others 

10 

The present invention also encompasses plants obtainable by the methods according to the 
present invention. The present invention therefore provides plants obtainable by the method 
according to the present Invention, which plants have modified growth characteristics, which 
plants have altered 2xC2H2 zinc finger protein level and/or activity and/or altered expression of 
15 a nucleic acid sequence encoding a 2xC2H2 zinc finger protein. 

Therefore, according to one aspect of the present invention, there is provided a method for the 
production of plants, having modified growth characteristics, comprising introducing, into a 
plant, a nucleic acid capable of modifying activity of a 2xC2H2 zinc finger protein and/or 
20 capable of modifying expression of a 2xC2H2 zinc-finger gene. According to a further 
embodiment of the present invention, there is provided a method for the production of 
transgenic plants having modified growth characteristics, comprising introduction and 
expression in a plant of a 2xC2H2 nucleic acid. 

25 More specifically, the present invention provides a method for the production of transgenic 
plants having modified growth characteristics, which method comprises: 
(i) introducing into a plant or plant cell a 2xC2H2 zinc finger nucleic acid; 

(iii) cultivating the plant cell under conditions promoting plant growth. 

30 The growth characteristic may be any of the characteristics defined hereinunder. 

The 2xC2H2 zinc finger nucleic acid includes all variant nucleic acids as described herein 
before and includes all nucleic acids encoding all variant proteins as described herein before. 
Cultivating the plant cell under conditions promoting plant growth, may or may not include 
35 regeneration and or growth to maturity. 
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The protein itself and/or the nucleic acid itself may be introduced directly into a plant cell or into 
the plant itself (including introduction into a tissue, organ or any other part of the plant). 
According to a preferred feature of the present invention, the nucleic acid is preferably 
introduced into a plant by transformation. . 

5 

The term "transformation" as referred to herein encompasses the transfer of an exogenous 
polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue 
capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may 
be transformed with a genetic construct of the present invention and a whole plant regenerated 

10 therefrom. The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary tissue 
targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus 
tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), 
and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem). The 

15 polynucleotide may be transiently or stably introduced into a host cell and may be maintained 
non-integrated, for example, as a plasmid. Alternatively, it may be integrated into the host 
genome. The resulting transformed plant cell can then be used to regenerate a transformed 
plant in a manner known to persons skilled in the art 

20 Transformation of a plant species is now a fairly routine technique. Advantageously, any of 
several transformation methods may be used to introduce the nucleic acid of interest (e.g. the 
2xC2H2 nucleic acid) into a suitable ancestor cell. Transformation methods include the use of 
liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA 
directly into the plant, particle gun bombardment, transformation using viruses or pollen and 

25 microprojection. Methods may be selected from the calcium/polyethylene glycol method for 
protoplasts (Krens, FA. et al., 1882, Nature 296, 72-74; Negrutiu I. et al., June 1987, Plant 
Mol. Biol. 8, 363-373); electroporation of protoplasts (Shillito R.D. et al., 1985 Bio/Technol 3, 
1099-1102); microinjection into plant material (Crossway A. et al., 1986, Mol. Gen Genet 202, 
179-185); DNA or RNA-coated particle bombardment (Klein T.M. et al., 1987, Nature 327, 70) 

30 infection with (non-integrative) viruses and the like. A preferred transformation method is an 
Argobacterium mediated transformation method. 

Transgenic rice plants expressing a 2xC2H2 gene are preferably produced via Agmbacterium- 
mediated transformation using any of the well-known methods for rice transformation, such as 
35 the ones described in any of the following: published European patent application EP 1198985 
A1, Aldemita and Hodges (Planta, 199, 612-617, 1996); Chan et al. (Plant Mol. Biol. 22 (3) 
491-506, 1993); Hiei et al. (Plant J. 6 (2) 271-282, 1994); which disclosures are incorporated 
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by reference herein as if fully set forth. In the case of com transformation, the preferred 
method Is as described In either Ishida et al. (Nat Biotechnol. 1996 Jun; 14(6): 745-50) or 
Frame et al. (Plant Physiol. 2002 May; 129(1): 13-22), which disclosures are incorporated by 
reference herein as if fully set forth. 

5 

Generally after transformation, plant cells or cell groupings are selected for the presence of 
one or more markers which are encoded by plant-expressible genes co-transferred with the 
gene of interest, following which the transformed material is regenerated into a whole plant 

10 Following DNA transfer and regeneration, putatively transformed plants may be evaluated, for 
instance using Southern analysis, for the presence of the gene of interest, copy number and/or 
genomic organisation. Alternatively or additionally, expression levels of the newly introduced 
DNA may be undertaken using Northern and/or Western analysis, both techniques being well 
known to persons having ordinary skill in the art. 

15 

The generated transformed plants may be propagated by a variety of means, such as by clonal 
propagation or classical breeding techniques. For example, a first generation (or T1) 
transformed plant may be selfed to give homozygous second generation (or T2) transformants, 
and the T2 plants further propagated through classical breeding techniques. 

20 

The generated transformed organisms may take a variety of forms. For example, they may be 
chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells 
transformed to contain the expression cassette); grafts of transformed and untransformed 
tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion). 

25 

The present invention clearly extends to any plant cell or plant produced by any of the methods 
described herein, and to all plant parts and propagules thereof. The present invention extends 
further to encompass the progeny of a primary transformed or transfected cell, tissue, organ or 
whole plant that has been produced by any of the aforementioned methods, the only 

30 requirement being that progeny exhibit the same genotypic and/or phenotypic characteristic(s) 
as those produced in the parent by the methods according to the invention. The invention also 
includes host ceils having modified expression and/or level and/or activity of a 2xC2H2 zinc 
finger protein. Such host cells for example comprise genetic constructs as mentioned above. 
Preferred host cells according to the invention are derived from a plant, algae, bacterium, 

35 fungus, yeast, Insect or animal. The invention also extends to harvestable parts of a plant such 
as but not limited to seeds, leaves, fruits, flowers, petals, stamen, stem cultures, stem, 
rhizomes, roots, tubers, bulbs or cotton fibers. 
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The term "plant" as used herein encompasses whole plants, ancestors and progeny of the 
plants and plant parts, Including seeds, shoots, stems, roots (including tubers), and plant cells, 
tissues and organs. The term "plant" also encompasses suspension cultures, embryos, 

5 meristematic regions, callus tissue, leaves, gametophytes, sporophytes, pollen, and 
microspores. Plants that are particularly useful in the methods of the invention include all 
plants which belong to the superfamily Viridiplantae, in particular monocotyledonous and 
dicotyledonous plants including, fodder or forage legumes, ornamental plants, food crop, tree, 
or shrub selected from the list comprising Acacia spp., Acerspp., Actinidia spp.,Aesculus spp., 

10 Agathis austraiis, Afbizia amara, Afsophifa tricolor, Andropogon spp., Arachis spp, Areca 
catechu, Astelia fragmns, Astragalus deer, Baikiaea plurijuga, Betula spp., Brasslca spp., 
Bruguiera gymnorrhiza, Burkea africana, Butea frondosa, Cadaba farinosa, Cailiandra spp, 
Camellia sinensis, Canna indica, Capsicum spp., Cassia spp., Centroema pubescens, 
Chaenomeles spp.,Cinnamomum cassia, Coffea arabica, Colophospermum mopane, 

15 Coronillia varia, Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp., 
Cyathea dealbata, Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea 
dealbata, Cydonia oblonga, Dalbergia monetaria, Davallia divarlcata, Desmodium spp., 
Dicksonia squarosa, Diheteropogon amplectens, Dioclea spp, Dolichos spp., Dorycnium 
rectum, Echinochloa pyramidalis, Ehrartla spp., Eleusine coracana, Eragrestis spp., Erythrina 

20 spp., Eucalyptus spp., Euclea schimperi, Eulalia villosa, Fagopyrum spp., Feijoa sellowiana, 
Fragaria spp., Flemingia spp, Freycinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycine 
javanica, Gliricidia spp, Gossypium hirsutum, Grevillea spp., Guibourtia coleosperma, 
Hedysarum spp., Hemarthia altissima, Heteropogon contortus, Hordeum vulgare, Hyparrhenia 
rufa, Hypericum erectum, Hyperthelia dissoluta, Indigo incarnata, Iris spp., Leptarrhena 

25 pyrolifolia, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetla simplex, Lotonus 
bainesii, Lotus spp., Macrotyloma axillare, Malus spp., Manlhot esculenta, Medicago sativa, 
Metasequoia glyptostroboides, Musa sapientum, Nicotianum spp., Onobrychis spp., 
Orniihopus spp., Oryza spp., Peltophorum africanum, Pennisetum spp., Persea gratissima, 
Petunia spp., Phaseolus spp., Phoenix canariensis, Phormium cookianum, Photinia spp., 

30 Picea glauca, Pinus spp., Pisum sativum, Podocarpus totara, Pogonarthria Heckii, 
Pogonarthria squarrosa, Populus spp., Prosopis cineraria, Pseudotsuga menziesii, 
Pterolobium stellatum, Pyrus communis, Quercus spp., Rhaphioiepsis umbellate, 
Rhopalostylis sapida, Rhus natalensis, Ribes gmssularia, Ribes spp., Robinia pseudoacacia, 
Rosa spp., Rubus spp., Salix spp., Schyzachyrium sanguineum, Sciadopitys verticillata, 

35 Sequoia sempervirens, Sequoiadendron giganteum, Sorghum bicolor, Spinacia spp., 
Sporobolus fimbriatus, Stiburus alopecuroides, Stylosanthos humilis, Tadehagl spp, Taxodium 
distichum, Themeda triandra, Trifollum spp., Triticum spp., Tsuga heterophylla, Vaccinium 
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spp., Vfc/a spp.Vitls wnffe/a, Watsonla pyramldata, Zantedeschla aethioplca, Zea mays, 
amaranth, artichoke, asparagus, broccoli, brussei sprout, cabbage, canola, carrot, cauliflower, 
celery, collard greens, flax, kale, lentil, oilseed rape, okra, onion, potato, rice, soybean, straw, 
sugarbeet, sugar cane, sunflower, tomato, squash, and tea, trees and algae amongst others. 
5 According to a preferred embodiment of the present invention, the plant is a crop plant such 
soybean, sunflower, canola, alfalfa, rapeseed, cotton, tomato, potato or tobacco. According to 
another preferred embodiment of the present invention, the plant is a monocotyledonous plant, 
such as sugar cane, further preferably a cereal, most preferably the plant is selected from the 
group consisting of rice, maize, wheat, barley, millet, rye or oats. 

10 

In a particular embodiment of the present invention, proteins of one plant species (for example 
Ambidopsis) are introduced in another plant species (for example rice). It has been shown in 
the present invention that plant growth characteristics are improved by introduction of a 
2xC2H2 zinc finger gene or protein from a dicot into a monocot. 

15 

According to a particular embodiment of the invention, there are provided methods as 
described above, wherein the plant is a monocot More preferably the plant is rice or corn. 

Advantageously, performance of the methods according to the present invention leads to 
20 plants having modified growth characteristics. 

The term "growth characteristic" as used herein, preferably refers to anyone or more of, but is 
not limited to, yield, architecture and cycle time. 

The term "yield" means the amount of harvested material. For crop plants yield also means the 
25 amount of harvested material per acre of production. Depending on the crop the harvested part 
of the plant may be a different part or tissue of the plant, such as seed (e.g. rice, sorghum or 
com when grown for seed); total above-ground biomass (e.g. for com, when used as silage), 
root (e.g. sugarbeet), fruit (e.g. tomato), cotton fibers, or any other part of the plant which is of 
economic value. "Yield* also encompasses yield stability of the plants, meaning that year after 
30 year, one can obtain the same yield from the progeny of the plants, without too much 
interference of external factors, such as weather conditions. "Yield" also encompasses yield 
potential, which as the maximum obtainable yield. 

Yield maybe dependent on a number of yield components. The parameters for these 
components are well known by a person skilled in the art. For example breeders are well 
35 aware of the specific yield components and the corresponding parameters for the crop they are 
aiming to improve. 
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For example key yield components for corn include number of plants per hectare or acre, 
number of ears per plant, number of rows (of seeds) per ear, number of kernels per row, and 
thousand kernel weight. For silage com typical parameters are the above ground biomass and 
energy content. 

5 Key yield components for rice include number of plants per hectare or acre, number of 
panicles per plant, number of spiketets per panicle, seed filling rate (number of filled seeds) 
and thousand kernel weight. Preferentially methods for increasing yield of rice encompass 
increased number of flowers per panicle and an increased number of filled seeds. The 
parameter of increased total number of seeds may be linked to increased number of flowers. 

10 "Yield* further encompasses typical biomass components, such as above ground parts of a 
plant and the root system. General biomass parameters are area and dry weight. Specific 
parameters for above ground biomass further encompass above ground area and plant height 
Specific parameters for the root system encompass root ratio, root length and penetration 
depth, root branching, root hair density, root pulling resistance and aerenchyma formation. 

15 

The plants of the present invention are characterized by increased number of filled seeds, 
increased total seed weight, increased total number of seeds and increased harvest index. 
Therefore the methods of the present invention are particularly favorable to be applied in 
cereals such as rice and corn (maize). Accordingly, a particular embodiment of the present 
20 invention relates to a method to increase yield of com, comprising modifying expression of a 
nucleic acid encoding a 2xC2H2 zinc finger protein. 

The plants of the present invention are characterized by an increase in thousand kernel weight 
and therefore the seed size or seed volume and/or the seed content and/or seed composition 
25 are altered by the methods of the present invention. The seeds provided by the methods of the 
present invention may have more nutritional value, more starch and/or more oil, possibly due 
to their bigger size. 

The plants of the present invention are characterized by more above ground area. Therefore, 
30 the methods of the present invention are particularly favorable for crops grown for their green 
tissue and/or grown for their above ground biomass. The methods of the present invention are 
particularly useful for grasses, forage crops (such as forage com (maize), clover, medicago 
etc.), trees, sugar cane etc. 

35 The improvement in yield as obtained by the methods of the invention, may be obtained as a 
result of improvement of one or more of the above mentioned yield components and/or 
parameters. 
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The term "architecture" as used herein encompasses the appearance or morphology of a 
plant, including any one or more structural features or combination of structural features 
thereof. Such structural features include the shape, size, number, position, texture, 

5 arrangement, and pattern of any cell, tissue or organ or groups of cells, tissues or organs of a 
plant, including the root, leaf, shoot, stem, petiole, trichome, flower, petal, stigma, style, 
stamen, pollen, ovule, seed, embryo, endosperm, seed coat, aleurone, fibre, cambium, wood, 
heartwood, parenchyma, aerenchyma, sieve element, phloem or vascular tissue, amongst 
others. Particular architectural characteristics that may be modified by the methods of the 

10 present invention are increased plant height, increased number or size of stems or stalks or 
tillers or panicles or pedicles, increased number or size of inflorescences, increased branching 
of for example of tassels and ears or altered flowering characteristics. A preferred architectural 
characteristic that may be modified by the methods of the present invention is leaf architecture. 
The term "leaf architecture" as used herein comprises typical leaf characteristics such as 

15 length, width, thickness, cell number, cell size and greenness. 

Typically, the plants of the present invention display increased leaf surface area and /or 
increased leaf blade width. This trait is particularly important as it allows the plant to optimize 
the shape of its leaf to maximize the area used for photosynthesis. For that purpose, preferably 
20 the leaf blade is widened, but alternatively, the leaves are longer or smaller or rounder. These 
effects may lead to more healthy plants. Alternatively, this trait attributes aesthetic properties to 
the plant such as greenness and stronger leafs. 

"cycle time" of the plant as used herein means the time wherein a plant reaches 90% of its 
25 maximum total area. This parameter is an indication of the duration of the vegetative growth. 
Prolonged vegetative growth was only displayed in some of the plants according to the present 
invention and may be controlled by choice of the transformation event and/or by choice of the 
promoter driving the 2xC2H2 nucleic acid. For example this characteristic was not displayed 
when a seed-preferred promoter was used. 

30 

Other "growth characteristics" that may be improved by the methods of the present invention 
are growth rate, early vigour, modified Tmid, T90 or A42 or altered growth curve. 

It is clear from the data as presented in the examples that one or more of the growth 
35 characteristics as defined herein above, may be combined in one plant Alternatively, 
depending on the chosen transformation event and/or depending on the promoter used, one 
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or more of these growth characteristics may be present or absent or more or less pronounced 
in the plant. 

The methods of the present invention may also be used to confer stress tolerance to plants. In 
5 particular, a 2xC2H2 of the STZ type may be used to confer to a plant salt stress tolerance 
and/or drought stress tolerance. According to a specirftc embodiment, a tissue preferred 
promoter, such as a seed-preferred promoter* is used in these methods. 

The present invention also relates to use of a nucleic acid sequence encoding a zinc finger 
10 protein and homologues, derivatives and active fragments thereof in modifying the growth 
characteristics of plants, preferably in increasing yield, further preferably increasing seed yield. 
The present invention also relates to use of a nucleic acid sequence encoding a 2xC2H2 zinc 
finger protein and homologues, derivatives and active fragments thereof and to the 2xC2H2 
zinc finger protein itself and to homologues, derivatives and active fragments thereof as a 
15 growth regulator. The sequences represented by SEQ ID NO 1, and portions thereof and SEQ 
ID NO 2, and homologues, derivatives and active fragments thereof are useful in modifying the 
growth characteristics of plants, as hereinbefore described. The sequences would therefore 
find use as growth regulators, such as herbicides or growth stimulators. The present invention 
also provides a composition comprising a protein represented by SEQ ID NO 2, or a 
20 homologue, derivative or active fragment thereof for the use as a growth regulator. A growth 
regulator is used herein as meaning a regulator that increased yield and is therefore also 
referred to as yield regulator. 

In particular, the present invention provides a yield regulating composition comprising a nucleic 
acid encoding a 2xC2H2 protein, and/ or comprising a 2xC2H2 protein, and/or comprising a 
25 construct as defined herein above. Such a yield regulating composition further comprises 
additives normally use in yield regulating compositions, such as a solvent or carrier. 

Conversely, the sequences according to the present invention may also be interesting targets 
for agrochemical compounds, such as herbicides or growth stimulators. Accordingly, the 
30 present invention encompasses use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 
protein and/or of a construct as defined in any of claims 20 to 22 as target for an agrochemical , 
such as a herbicide or a growth stimulator. 

The methods according to the present invention may also be practised by co-expression of a 
gene encoding a 2xC2H2 zinc finger protein in a plant with at least one other gene that 
35 cooperates with the gene encoding a 2xC2H2 zinc finger protein. Such a gene may be a gene 
encoding a target protein of the 2xC2H2 zinc finger protein. Co-expression may be effected by 
cloning the genes under the control of a plant expressible promoter in a plant expressible 
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vector and introducing the expression vector(s) into a plant cell using >4groJbacferfi//n-mediated 
plant transformation. Therefore, the methods according to the present invention may result in 
plants having modified growth characteristics, particularly increased yield, as described 
hereinbefore in combination with other economically advantageous traits, such as further yield- 
5 enhancing traits, tolerance to various stresses, traits modifying various architectural features 
and/or biochemical and/or physiological features. 

Since the plants of the present invention have excellent growth characteristics and have high 
yield, they are suitable for the production of enzymes, pharmaceuticals or agrochemicals. Also, 
1 0 there are suitable to produce food or feed products. 

The invention dearly extends to enzymes, pharmaceuticals or agrochemicals as well as food 
or feed products isolated from these plants. 

Further a nucleic acid encoding a 2xC2H2 protein, a 2xC2H2 protein and/or the constructs of 
the present invention may be used breeding programs aiming at the development of plants 
15 with increased yield. 

Particularly, the use of allelic variants as defined above in particular conventional breeding 
programmes, such as in marker-assisted breeding is also encompassed by the present 
invention; this may be in addition to their use in the methods according to the present 

20 invention. Such breeding programmes sometimes require the introduction of allelic variations 
in the plants by mutagenic treatment of a plant One suitable mutagenic method is EMS 
mutagenesis. Identification of allelic variants then takes place by, for example, PCR. This is 
followed by a selection step for selection of superior allelic variants of the sequence in question 
and which give rise to altered growth characteristics in a plant. Selection is typically carried out 

25 by monitoring growth performance of plants containing different allelic variants of the sequence 
in question, for example, SEQ ID NO 1. Monitoring growth performance may be done in a 
greenhouse or In the field. Further optional steps include crossing plants in which the superior 
allelic variant was identified with another plant. This could be used, for example, to make a 
combination of interesting phenotypic features 

30 

According to another type of breeding programme a DNA marker is identified which may be 
genetically linked to a gene capable of modifying expression of a nucleic acid encoding a 
2xC2H2 zinc finger protein in a plant, which gene may be a gene encoding the 2xC2H2 zinc 
finger protein itself or any other gene which may directly or indirectly influence expression of 
35 the gene encoding a 2xC2H2 zinc finger protein and/or activity of the 2xC2H2 zinc finger 
protein itself. This DNA marker may then be used in breeding programs to select plants having 
altered growth characteristics. 
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The methods according to the present invention may also be practised by introducing into a 
plant at least a part of a (natural or artificial) chromosome (such as a Bacterial Artificial 
Chromosome (BAG)), which chromosome contains at least a gene encoding a 2xC2H2 zinc 
5 finger protein, optionally together with one or more related gene family members. Therefore, 
according to a further aspect of the present invention, there is provided a method for modifying 
growth characteristics of plants by expressing in a plant at least a part of a chromosome 
comprising at least a gene encoding a 2xC2H2 zinc finger protein. 

10 The present invention will now be described with reference to the following figures In which: 

Fig. 1 is a map of an expression vector for the expression in plants of a 2xC2H2zinc finger 
protein under the control of a GOS2 promoter. CDS1536 is the internal code for the 
Arabidopsis thaliana salt tolerant zinc finger (STZ) protein cDNA. The zinc finger protein 

IS expression cassette has a GOS2 promoter and a double terminator sequence (T-zein and T- 
rbcS-deltaGA) located within the left border (LB repeat) and the right border (RB repeat) of the 
71 plasmid. Cloned within these T-borders are also a screenable marker and a selectable 
marker, each under the control of a constitutive promoter (Prom), followed by a terminator 
sequence (poly a and t-NOS). Furthermore, this vector also contains an origin of replication 

20 (pBR322 (ori + bom)) for bacterial replication and a selectable marker (Sm/SpR) for bacterial 
selection. 

Fig. 2A shows digital images from a T1 positive line transformed with an STZ zinc finger 
transgene under control of a GOS2 promoter and Fig. 2B shows digital images of 
25 corresponding nuilizygotes plants. 

Fig. 3 lists sequences useful in the methods of the present invention. SEQ ID NO 1 is an STZ 
encoding nucleic acid isolated from Arabidopsis thaliana] the start and the stop codon are 
highlighted in bold. SEQ ID NO 2 is the STZ protein sequence encoded by SEQ ID NO 1. In 

30 the STZ protein the nuclear localization signal also called the KRS motif or B-box is annotated 
(bold, italics, underlined), as well as the L-box (bold, underlined), the EAR motif (bold, italics), 
and the two C2H2 zinc finger domains with QALGGH motif (bold and boxed). SEQ ID NO 10 
to SEQ ID NO 25 provides the sequences of various orthologs of the Arabidopsis thaliana STZ 
protein from other plant species. SEQ ID NO 26 to SEQ ID NO 35 provides the sequences of 

35 various paralogs (from Arabidopsis) of the STZ protein. SEQ ID NO 36 to SEQ ID NO 50 
provides the sequences of related 2xC2H2 genes and proteins useful in the methods of the 
present invention. 
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Fig. 4 is a photograph of T3 plants grown in a greenhouse (A) or in a field (B). The photograph 
shows yield increase ( especially in aboveground biomass and plant height) in subsequent 
generations of STZ transformed plants. 

5 

Fig. 5 shows the binary vector for expression in Oryza sativa of the Arabidopsis thaliana STZ 
gene (CDS1536) under the control of a seed preferred WSI18 promoter (PRO0151). This 
vector contains a T-DNA derived from the Ti Plasmid, limited by a left border (LB repeat, LB Ti 
C58) and a right border (RB repeat, RB Ti C58)). 

10 The zinc finger protein expression cassette has a WSI18 (PRO0151) promoter and a double 
terminator sequence (T-zein and T-rbcS-deltaGA) located within the left bonder (LB repeat) 
and the right border (RB repeat) of the Ti plasmid. Cloned within these T-borders are also a 
screenable marker and a selectable marker, each under the control of a constitutive promoter 
(Prom), followed by a terminator sequence (poly a and t-NOS). Furthermore, this vector also 

15 contains an origin of replication (pBR322 (ori + bom)) for bacterial replication and a selectable 
marker (Sm/SpR) for bacterial selection. 

Examples 

The present invention will now be described with reference to the following examples, which 
20 are by way of illustration alone. 

DNA Manipulation 

Unless otherwise stated, recombinant DNA techniques are performed according to standard 
protocols described in Sambrook (2001) Molecular Cloning: a laboratory manual, 3rd Edition 
25 Cold Spring Harbor Laboratory Press, CSH, New York or In Volumes 1 and 2 of Ausubel et a/. 
(1988), Current Protocols in Molecular Biology, Current Protocols. Standard materials and 
methods for plant molecular work are described in Plant Molecular Biology Labfese (1993) by 
R.D.D. Croy, published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific 
Publications (UK). 

30 

Example 1: Gene Cloning 

A gene encoding an STZ protein was amplified by PGR from an Arabidopsis thaliana seedling 
cDIMA library (Invitrogen, Paisley, UK). After reverse transcription of RNA extracted from 
seedlings, the cDNAs were cloned into pCMV Sport 6.0. Average insert size of the bank was 
35 1.5 kb, and original number of clones was of 1.59x10 7 cfu. Original titer was determined to be 
9.6x10 5 cfo/ml, after first amplification of 6x10 11 cfu/ml. After plasmid extraction, 200 ng of 
template was used in a 50\i\ PCR mix. Sequences of the primers used for PCR amplification 
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were, including the attB sites for Gateway recombination (in bold) were PRM3204 (sense, start 
codon in italics) 5' GGGG ACAAGTTTGTACAAAAAAGC AGGCTTCACA/\ TGGCG 
CTCGAGGCTC 3' (SEQ ID NO 3) and PRM3205 (reverse, complementary stop codon in 
italics) 5' GGGGACCACTTTGTACAAGAAAGCTGGGTAATTTCC7T4AAGTTGAAGT7TGA 
5 3' (SEQ ID NO 4). 

PCR was performed using Hifi Taq DNA polymerase in standard conditions. The PCR 
fragment (CDS1536) was amplified and purified using standard methods. The first step of the 
Gateway procedure, the BP reaction, was then performed, during which the PCR fragment 
10 was recombined in vivo with the pDONR plasmid to produce, according to Gateway 
terminology, an "entry clone", p3359. PDONR was purchased from Invitrogen, as part of the 
Gateway technology. 

Example 2: Vector construction for rice transformation with pGOS2::A*STZ 
15 The entry done p3359 was subsequently used in an LR reaction with p0640, a destination 
vector used for rice transformation. This vector contains as functional elements within the T- 
DNA borders a plant selectable marker and a Gateway cassette intended for LR in vivo 
recombination with the sequence of interest already cloned in the donor vector. Upstream of 
this Gateway cassette lies the rice GOS2 promoter for constitutive expression of the zinc finger 
20 gene (De Pater et a/., Plant J. 2 (6) 837-844, 1992). After the recombination step, the resulting 
expression vector with the expression cassette CD4398 (Figure 1) was transformed into 
Agrobacterium strain LBA4404 and subsequently into plants. Transformed rice plants were 
allowed to grow and then examined for various parameters as described in Example 3. 

25 Example 3: Evaluation of TO, T1 and T2 transgenic rice plants transformed with 
pGOS2::AfSTZ (CD4398) 

Approximately 15 to 20 independent TO transformants were generated. The primary TO 
transformants were transferred from tissue culture chambers to a greenhouse for growing and 
harvest of T1 seed. Six events of which the T1 progeny segregated 3:1 for presence/absence 

30 of the transgene were retained. For each of these events, approximately 10 T1 seedlings 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedlings 
lacking the transgene (nullizygotes), were selected by PCR. Based on the results of the T1 
evaluation three events were chosen, for further characterisation in the T2 generation, one 
event being very positive for a number of parameters, a second event being positive for a 

35 number of parameters, but less pronounced, and a third event being neutral. Seed batches 
from the positive plants (both hetero- and homozygotes) in T1, were screened by monitoring 
marker expression. For each chosen event, the heterozygote seed batches were then selected 
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for T2 evaluation. An equal number of positives and negatives within each seed batch were 
transplanted for evaluation in the greenhouse (i.e., for each event 40 plants were grown of 
which there were about 20 positives for the transgene and about 20 negative). Therefore, the 
total number for the three events amounted to 120 plants for evaluation in the T2 generation. 

5 

T1 and 72 plants were transferred to the greenhouse and evaluated for vegetative growth 
parameters and seed parameters, as described hereunder. 

(h Statistical analysis of phenotvplc characteristics 

10 A two factor ANOVA (analyses of variance) was used as statistical model for the overall 
evaluation of plant phenotypic characteristics. An F-test was carried out on all the parameters 
measured, for all the plants of all the events transformed with the gene of interest The F-test 
was carried out to check for an effect of the gene over all the transformation events and to 
verify an overall effect of the gene or "global gene effect". Significant data, as determined by 

15 the value of the f-test, indicates a "gene 0 effect, meaning that the phenotype observed is 
caused by more than the presence or position of the gene. In case of the F-test, the threshold 
for significance for a global gene effect is set at 5% probability level. 

To check for an effect of the genes within an event, i.e., for a line-specific effect, a t-test was 
20 performed within each event using data sets from the transgenic plants and the corresponding 
null plants. "Null plants" or "Null segregants"are the plants treated in the same way as the 
transgenic plant, but from which the transgene has segregated. Null plants can also be 
described as homozygous negative transformant plants. The threshold for significance for the 
t-test is set at 10% probability level. Within one population of transformation events, some 
25 events may be under or above this t-test threshold. This is based on the hypothesis that a 
gene might only have an effect in certain positions in the genome, and that the occurrence of 
this position-dependent effect is not uncommon. This kind of gene effect may also be referred 
to as a "line effect of a gene*. The p- value is obtained by comparing the t-value to the t- 
distribution or alternatively, by comparing the F-value to the f-distribution. The p- value stand 
30 for the probability that the null hypothesis (null hypothesis being "there is no effect of the 
transgene") is correct. 

(II) Vegetative growth measurements 

The selected plants were grown in a greenhouse. Each plant received a unique barcode label 
35 to link unambiguously the phenotyping data to the corresponding plant. The selected plants 
were grown on soil in 10 cm diameter pots under the following environmental settings: 
photoperiod- 11.5 h, daylight intensity 30,000 lux or more, daytime temperature^ 28°C or 
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higher, night time temperature^ 22°C, relative humidity= 60-70%. Transgenic plants and the 
corresponding nullizygotes were grown side-by-side at random positions. From the stage of 
sowing until the stage of maturity (which is the stage were there is no more increase in 
biomass) the plants were passed weekly through a digital imaging cabinet (examples of 
5 pictures are shown in Figures 2A and 2B). At each time point digital images (2048x1536 pixels, 
16 million colours) were taken of each plant from at least 6 different angles. The parameters 
described below were derived in an automated way from the digital images using image 
analysis software. 

10 (a) Aboveground area 

Plant above ground area was determined by counting the total number of pixels from 
aboveground plant parts discriminated from the background. This value was averaged for the 
pictures taken on the same time point from the different angles and was converted to a 
physical surface value expressed in square mm by calibration. Experiments show that the 

15 aboveground plant area measured this way correlates with the biomass of plant parts above 
ground. 

Results of the maximum above ground area values of the lines selected for T2 evaluation are 
summarized in Table 1. The plants of the best performing line showed an increase in biomass 
20 of 34 % , compared to the nullizygotes. 

When an F-test was carried out on all the plants of all the T2 events it became clear that the 
transgenic plants show a significant increase in above ground area, in average an increase of 
approximately 18%. A significant increase in above ground biomass is also displayed by STZ 
transformed plants grown under field conditions (see figure 4). 

25 

Table 1 : Aboveground area of STZ transgenic T2 plants. Each row con-esponds to one event, 
for which the average maximum aboveground area (expressed in mm 2 ) was determined for the 
transgenics (TR) and the null plants (null). The difference in absolute values between the 
transgenic population and the nullizygotes of each event are presented (dif.) as well as the 
30 percentage of difference between the two populations (% dif). P stands for the probability 
produced by the Wesf for each event. The last row presents the average numbers calculated 
from all the events. Here the p-value is produced by the F-test. 
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Total above ground Area Max (mm*) 


Line 


TR 


null 


dlf 


% dlf 


p-value 


CD4396 LI 


63947 


47606 


16341 


34 


0.0021 


CD4396 L2 


42509 


41342 


1167 


3 


0.8063 


CD4396 L3 


41116 


33687 


7429 


22 


0.1107 


Overall 


49178 


41657 


7522 


18 


0.0047 



(b) Plant height measurements 

Plant height was determined by the distance between the horizontal lines going through the 
5 upper pot edge and the uppermost pixel corresponding to a plant part above ground. This 
value was averaged for the pictures taken on the same time point from the different angles and 
was converted, by calibration, to a physical distance expressed in mm. Experiments showed 
that plant height measured this way correlate with plant height measured manually with a ruler. 

10 The increase in plant height was displayed very clearly in STZ transformed plants when 
measured at the end of the vegetative growth (see figure 4A). Also, this parameter, was 
displayed by STZ transformed plants when grown in the field (see figure 4B) at the time of 
harvest. 

1 5 (c) Total area cycle time measurements 

Plants were imaged weekly along the complete cell cycle and the maximum total area of the 
plants was determined as mentioned above. Total Area Cycle Time is the time when a plant 
reaches 90% of its maximum total area. This parameter is an indication of the duration of the 
vegetative growth. 

20 Only in some transgenic lines there was an effect on cycle time. These few lines showed a 
prolonged vegetative growth. 

(lift Measurement of seed-related parameters 

The mature primary panicles were harvested, bagged, barcode-labelled and then dried for 
25 three days in the oven at 37 P C. The panicles were then threshed and all the seeds collected. 
The filled husks were separated from the empty ones using an air-blowing device. After 
separation, both seed lots were then counted using a commercially available counting 
machine. The empty husks were discarded. The filled husks were weighed on an analytical 
balance and the cross-sectional area of the seeds was measured using digital imaging. This 
30 procedure resulted in the set of seed-related parameters described below. 
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(a) Total number of filled seeds per plant 

The number of filled seeds was determined by counting the number of filled husks that 
remained after the separation step. 

5 Total numbers of filled seeds per plant are summarized in Table 2. The t-test shows that 
for two events, transgenic plants produce 106% and 130% more filled seeds than the 
nulllzygotes. 

Table 2: Number of filled seeds of STZ transgenic T2 plants. Each row corresponds to one 
10 event, for which the average number of filled seeds was determined for the transgenics (TR) 
and the null plants (null). The difference in absolute values between the transgenic population 
and the nullizygotes of each event are presented (dif.) as well as the percentage of difference 
between the two populations (% dif). P stands for the probability produced by the t-test for 
each event The last row presents the average numbers calculated from all the events. Here 
15 the p-value is produced by the FJest. 



Number of filled seeds 


Line 


TR 


null 


dlf 


% dlf 


p-value 


CD4396 LI 


387.9 


188.7 


199.19 


106 


<0.0001 


CD4396 L2 


163.8 


156.5 


7.22 


5 


0.8382 


CD4396 L3 


236.9 


102.9 


133.98 


130 


0.0004 


Overall 


264.9 


159.7 


105.25 


66 


<0.0001 



(b) Total seed weight per plant 

The total seed weight was measured by weighing all filled husks harvested from a plant 

20 The total seed weight values of STZ transformed plants are summarized in Table 3. STZ 
transgenic plants produce significantly more seed weight than the corresponding nullizygotes. 
The difference in seed weight of the transgenics may be as high as 138% or higher. 

Table 3: Total seed weight per plant of STZ transgenic T2 plants. Each row corresponds to 
25 one event, for which the average total seed weigh (in gram) was determined for the 
transgenics (TR) and the null p/anfe (null). The difference in absolute values between the 
transgenic population and the nullizygotes of each event are presented (dif.) as well as the 
percentage of difference between the two populations (% dif). P stands for the probability 
produced by the t-test for each event The last row presents the average numbers calculated 
30 from all the events. Here the p-value is produced by the F-test. 
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Total weight of seeds 


Line 


TR 


null 


dlf 


%dlf 


p -value 


CD4396 LI 


9.8 


4.5 


5.25 


116 


<0.0001 


CD4396 L2 


3.4 


3.3 


0.1 


3 


0.908 


CD4396 L3 


6.1 


2.6 


3.56 


138 


0.0001 


Overall 


6.5 


3.7 


2.75 


74 


<0.0001 



(c) Harvest Index 

The harvest index in the present invention is defined as the ratio between the total seed yield 
and the above ground area (mm 2 ), multiplied by a factor 10 6 . 

5 

The harvest index values of the STZ-transgenic plants are summarized in Table 4. STZ 
transgenic plants have a significant increase in harvest index. The increase in harvest index of 
the transgenic plants may be as high as 66%, when compared to the corresponding 
nullizygotes. 

10 

Table 4: Harvest index of STZ transgenic T2 plants. Each row corresponds to one event, for 
which the average harvest index was determined for the transgenics (TR) and the nuli piants 
(null). The difference in absolute values between the transgenic population and the 
nullizygotes of each event are presented (dif.) as well as the percentage of difference between 
15 the two populations (% dif)- P stands for the probability produced by the t-test for each event 
The last row presents the average numbers calculated from all the events. Here the p-value is 
produced by the F-test. 



Harvest index 


Une 


TR 


null 


dlf 


%dif 


p-value 


CD4396 LI 


149.1 


90 


59.11 


66 


<0.0001 


CD4396 L2 


74 


73.4 


0.55 


1 


0.9574 


CD4396 13 


121.3 


75.9 


45.32 


60 


<0.0001 


Overall 


114.8 


82.6 


32.16 


39 


<0.0001 



(d) Thousand kernel welgth (TKW) of plants 
20 Thousand Kernel Weight (TKW) is a parameter extrapolated from the number of filled seeds 
counted, and their total weight. 

The weight values of thousand kernels of STZ transgenic plants are presented in Table 5. STZ 
transgenic plants have increased thousand kernel weight. The increase of TKW of transgenic 
plants may be as high as 6% when compared to the corresponding nullizygotes. 
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Table 5: Thousand kernel weight ofSTZ transgenic 72 plants. Each row corresponds to one 
event, for which the average TKW was determined for the transgenics (TR) and the null plants 
(null). The difference in absolute values between the transgenic population and the 
5 nullizygotes of each event are presented (dif.) as well as the percentage of difference between 
the two populations (% dif)- P stands for the probability produced by the t-test for each event 
The last row presents the average numbers calculated from all the events. Here the p-value is 
produced by the F-test 



TKW 


Une 


TR 


null 


dif 


%dlf 


p-value 


CD4396U 


25.2 


23.8 


1.46 


6 


0.0128 


CD4396 U 


20.6 


20.7 


-0.14 


-1 


0.7963 


CD4396 13 


25.5 


24.5 


0.99 


4 


0.0812 


Overall 


23.7 


23 


0.71 


3 


0.0213 



10 (e) Total number of seeds 

The total number of seeds per plant was measured by counting the number of husks harvested 
from a plant. 

The total numbers of seeds per plant are summarized in Table 6. STZ transformed plants have 
15 an increase in total number of seeds. The increase of total number of seeds may be as high as 
68%, when compared to the corresponding nullizygotes. 

Table 6: Total number of seeds of STZ transgenic T2 plants. Each row corresponds to one 
event, for which the average total number of seeds was determined for the transgenics (TR) 
20 and the null plants (null). The difference in absolute values between the transgenic population 
and the nullizygotes of each event are presented (dif.) as well as the percentage of difference 
between the two populations (% dif). P stands for the probability produced by the t-test for 
each event The last row presents the average numbers calculated from all the events. Here 
the p-value is produced by the F-test. 



Total number of seeds 


Une 


TR 


null 


dif 


% d if 


p-value 


CD4396 LI 


483.5 


367.4 


116.03 ! 


32 


0.0146 


CD4396 L2 


353.9 


327.5 


26.42 


8 


0.5473 


CD4396 L3 


383.6 


228.2 


155.48 


68 


0.0009 


Overall 


406 


312.5 


93.52 


30 


0.0002 
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Conclusion 

It may be conducted that vegetative growth Is increased in the STZ transgenic plants when 
compared to the control non -transgenic plants, as reflected by parameters such as above 

5 ground area, where the increase is above 20 %. This effect may be attributed to the 
expression of the STZ gene in the transgenic plants. Additionally, in some transformation 
events, the length of the vegetative growth is altered in the STZ transgenic plants. For those 
transformation events in which this effect occurs, in average the vegetative growth was 
prolonged with about 4 to 6 days, under the conditions tested. 

10 Furthermore, yield was increased in STZ transgenic plants. Several seed parameters reflect 
this yield increase. The total number of seeds harvested was at least 100% higher in the 
transgenics than in the control plants, for those events showing a differential. For these 
events, there was also an increase in the total number of seeds of the transgenics, which 
increase was higher than 30 %. Seed filling in those transgenics was greatly improved, 

1 5 reaching differences above 1 00% in the number of filled seeds. 

Seed of the transgenic plants were also heavier, and probably bigger, as suggested by the 
higher values obtained for the thousand kernel weigh. The TKW parameter is a very stable 
parameter in rice cultivars, such as nipponbare, and in the growth conditions here used. This 
means that this parameter is not easily influenced and makes it an important yield parameter. 

20 Therefore a TKW increase of 6 % represents a significantly increase in yield. 

Harvest index, another important yield parameter, was increased in the transgenic plants with 
more than 50 %. 

Jn summary, based on the evaluation of STZ transgenic plants in the T1, T2 and further 
generations, it may be concluded that the presence of an STZ transgene, has a positive effect 
25 on the size of the plant and/or its organs, as well as a positive effect on the final yield 
harvested. 

(Ill) Root growth measurements 

Transgenic plants are grown next to their corresponding non-transgenic null segregant in 
transparent pots. In average, for each construct comprising a particular promoter-2xC2H2 
30 combination, a minimum of 5 independent transformation events are evaluated for root growth, 
root development and root architecture. Typically, per transformation event, 10 transgenics are 
compared to 10 nullizygotes. Root pictures are taken weekly during plant growth. The pictures 
are processed and analyzed to extract the values for the root parametes as detailed below. 
Statistical analysis as described above are applied to these data. 

35 
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a) Root Area 

Total root area Is calculated from the summed number of pixels of each root images. A positive 
linear correlation between root area and dry weight and root biomass of the root has been 
previously established by similar experiments. Therefore, root area Is a good approximation for 
5 root biomass. 

b) Root Length 

The total perimeter of the roots of a plant is calculated as the sum of the perimeter of all roots 
in the images. A linear correlation between this measurement and root length has been 
10 previously established. Thus, root length is extrapolated from the total root perimeter. 

c) Root Width 

Average root width of a plant is expressed as the ratio between the Root Area and the Root 
Length. 

15 

STZ transgenic plants of the invention show a superior performance when compared to control 
pants. Transgenic plants are altered in one or more the root parameters detailed above. In 
particular the transgenic have increased root biomass, for example due to increased root dry 
weight or area, and/or increased root length and/or increased root width. 

20 

Example 4: Leaf Blade Width Measurement 

Leaves of STZ transgenic plants appeared bigger and wider when compared to the 
corresponding control non-transgenic plants. To quantify the increase in leaf width, leaf blade 
width (length of transversal axe) of the flag leaf was measured with a ruler at the widest point 
25 of the leaf, which is approximately at half of the length, in plants that have reached the end of 
the vegetative growth phase. The results shown in the Table 7, indicate that the increase in 
the leaf blade width in at least the event here measured was around 15 % when compared to 
the corresponding nullizygote. 

30 Table 7: Leaf blade width of STZ transgenic T2 plants. The average leaf blade width was 
determined for the transgenics (TR) and the null plants (null) of the selected event The 
difference in absolute values between the transgenic population and the nullizygotes of the 
event is presented (dif.) as well as the percentage of difference between the two populations 
(% dif). P stands for the probability produced by the t-test . 

35 
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Leaf blade width 


Une 


TR 


null 


dlf 


% dif 


p-value 


CD4396 LI 


1.56 


1.35 


0.21 


15 


0.098 



Example 5: Vector construction for rice transformation with pWSI18::AtSTZ 

Vector construction for transformation with the pWSI18 (PRO0151) - AtSTZ (CDS1536) 
cassette was carried out essentially as In example 2. The entry clone p3359, described earlier, 
was subsequently used in an LR reaction with p05653, a destination vector used for rice 
transformation. This destination vector contains as functional elements within the T-DNA 
borders a plant selectable marker and a Gateway cassette intended for LR in vivo 
recombination with the sequence of interest already cloned in the donor vector. A WSI18 
promoter for seed preferred expression (PRO0151) is located upstream of this Gateway 
cassette. After the recombination step, the resulting expression vector with the expression 
cassette CD4398 (Figure 5) was transformed into Agrobactehum strain LBA4404 and 
subsequently this vector was transformed to Oryza saiiva plants. Transformed rice plants were 
allowed to grow and then examined for various parameters as described in example 3. 

Example 6: Evaluation of TO and T1 transgenic rice plants transformed with the seed 
preferred expression cassette pWS1 18:: AtSTZ (CD4398) 

Preparations of calli and of the Agrobactehum tumefaciens strain containing the expression 
vector with the CD4398 expression cassette, were carried out as described in example 3, as 
were the calli transformation and plant regeneration. 

Approximately 15 to 20 independent TO rice transformants were generated. The primary 
transformants were transferred from tissue culture chambers to a greenhouse for growing and 
harvest of T1 seed. Events, of which the T1 progeny segregated 3:1 for presence/absence of 
the transgene, were retained. For each of these events, approximately 10 T1 seedlings 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedlings 
lacking the transgene (nullizygotes), were selected by monitoring marker expression. 
Transgenic plantlets were grown next to control nullyzygotes, seeds were harvested and 
thousand kernel weight determined as previously described. 

Transformed plants comprising the expression cassette CD8490 (seed preferred 
pWSI18::STZ), had a normal and healthy appearance and were harvested at the same time as 
the control plants. The seeds harvested from the transgenic plants had an increase in 
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thousand kernel weight when compared to the control plants. As shown in Table 8 increase in 
thousand kernel weight was above 10%. 

Table 8: Thousand kernel weight ofSTZ transgenic T1 plants. The average I thousand kernel 
5 weight was determined for the transgenics (TR) and the null plants (null) of the selected event 
The difference in absolute values between the transgenic population and the nullizygotes of 
the event is presented (dif.) as well as the percentage of difference between the two 
populations (% dif). P stands for the probability produced by the t-test . 



Thousand kernel weight 


Une 


TR 


null 


dif 


% dif 


p-value 


CD8490 U 


29.6 


26.8 


2.82 


11 


0.001 



10 

Example 7: Cloning, transformation and evaluation of other 2xC2H2 encoding genes. 

In Table 9 an overview is given of constructs with STZ or other 2xC2H2 zinc finger proteins, 
under control of various promoters, which constructs are made for use in the methods of the 
present invention. The coding regions of the 2xC2H2 genes to be cloned (GOI, Gene of 

15 Interest) are amplified by PCR from cDNA , following the protocol as in Example 1. Specific 
primers for each 2xC2H2 gene were designed at the start and stop codons of the gene 
sequence as present in the public database under the accession number as indicated in Table 
9. These cloned sequences are also herein incorporated under the SEQ ID NO number as 
mentioned in the table. Moreover, the isolated PCR fragments were also given a unique CDS 

20 number. 

The PCR fragment with a 2xC2H2 gene is then cloned under the control of a particular 
promoter. Different combinations for different genes are made (see Table 9). Chimeric 
constructs are made and CD numbers represent bacterial strains carrying the chimeric 
construct. Corresponding transgenic plants are obtained by transforming the plants with the 
25 chimeric constructs, following the protocols as mentioned herein before. Evaluation of the 
transgenic events reveals an increase in yield, and increase in leaf surface area and/or an 
increase in duration of vegetative growth in the transgenic plants when compared to the control 
non-transgenic plants. 

30 
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CD-070-PCT 



Table 9: examples of 2xC2H2 chimeric constructs useful for the methods of the present invention. *see Tz 



CDS 


Accession number 
cDNA on which primers 
were designed to amplify 
the CDS region) 


Prot ACC number 


SEQ ID NO 


PRO0129* 


PRO0170* 


PRO0081_2* 


PRO0123* 


PRO0207* 


PRO011I 


CDS1536 
STZ 

Arabldopsfs 


X95573 


CAA64820 


1+2 


CD4398 


CD11371 


CD11382 


CD10960 


CD10959 


CD10315 


CDS2200 

Paralog 

Arabldopsls 


AF022658NMJ20516 


AAB80922.1At5g04340 


28 + 29 


CD11576 






CD11413 




CD1154( 


CDS2205 

Paralog 

Arabldopsfs 


NMJ23683 


At5g43170 


32 + 33 


CD11325 






CD11414 




CD1138' 


CDS2775 
Ortholog 
Oryza 
sativa 


AF332876 


AAK01713.1 


38 + 37 


CD09948 










CD1031J 


CDS1677 
Homolog 
Arabldopsls 


AL132966 REGION: 
116202..116729 


CAB67667 


38 + 39 


CD08462 






CD 




CD 


CDS3337 
Homolog 
Sugarcane 


CA279020 




40 


CD 






CD 




CD 


CDS2416 
Homolog 
Arabldopsls 


AF254447 


At3g57670 


41+42 


CD 






CD 




CD 


CDS2377 
Homolog 
Arabldopsls 


AJ311810 


CAC86167 


43 + 44 


CD 






CD 




CD 


CDS 

Homolog 

Arabldopsls 


AL355775 REGION: 
compfement(7957..8451 ) 


CAB90935 


45 + 46 


CD 






CD 




CD 


CDS 

Homolog 

Arabldopsls 


AL391143 REGION: 
complemsnt(31730..32938) 


CAC01747 


47 + 48 


CD 






CD 




CD 


CDS3641 
Homolog 
Arabldopsls 


X98676 


CAA67236 


49 + 50 


CD 






CD 




CD 
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Table 10: examples promoters used in combination with 2xC2H2 for the methods of the 
present invention. 



Promoter 


Preferred expression type 


Origin species 


Gene 


PRO0151 


Seeds (mainly embryo and 
aleurone). Strong expression. 


Oryza sativa 


WSI18 


PRO0110 


Root 


Oryza sativa 


RCC3 




levels 


Saecharum officinarum 

wGvvl IOl Ul I ■ vl llwll icii ui ■ ■ 


Pro 

rip 


=RO0123 


Green tissue. Strong expression 
evels. 


Oryza sativa 


Protochlorophyllide 
reductase 


PRO0090 


Seed specific (mainly endosperm) 


Oryza sativa 


Prolamin RP6 


PRO0170 


Constitutive. Strong Expression. 


Oryza sativa 


High Mobility 
Group protein 


PRO0218 


Seeds (mainly embryo and 
aleurone) 


Oryza sativa 


oleosine 18kda 


PRO0061_2 


Young expanding tissues 


Oryza sativa 


beta-expansine 
EXPB9 


PRO0129 


Constitutive- High expression levels. 


Oryza sativa 


GOS2 



Example 8: use of the invention in corn. 

The methods of the invention described herein are also used in maize. To this aim, an STZ 
encoding gene, for example a maize or other STZ ortholog, is cloned under control of a 
promoter operable in maize, in a plant transformation vector suitable for Agrobacterium- 
10 mediated corn transformation. Methods to use for com transformation have been described in 
literature (Ishida et ah, Nat Biotechnol. 1996 Jun;14(6):745-50; Frame et al., Plant Physiol. 
2002 May;129(1):13-22). 

Transgenic plants made by these methods are grown in the greenhouse for T1 seed 
15 production, lnheritability and copy number of the transgene are checked by quantitative real- 
time PCR and Southern blot analysis and expression levels of the transgene are determined 
by reverse PCR and Northern analysis. Transgenic lines with single copy insertions of the 
transgene and with varying levels of transgene expression are selected for T2 seed 
production. 

20 

Progeny seeds are germinated and grown in the greenhouse in conditions well adapted for 
maize (16:8 photoperiod, 26-28°C daytime temperature and 22-24°C nighttime temperature) 
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as well under water-deficient, nitrogen-deficient, and excess NaCI conditions. Null segregants 
from the same parental line, as well as wild type plants of the same cultivar are used as 
controls. The progeny plants resulting from the selfing or the crosses are evaluated on different 
biomass and developmental parameters, including, plant height, stalkfetem thickness, stem 
5 size, number of leaves, total above ground area, leaf greenness, time to maturity, time to 
silking, flowering time, time to flower, ear number, ear length, row number, kernel number, 
kernel size , kernel oil content, grain maturity, harvesting time. The seeds of these lines are 
also checked on various parameters, such as grain size, total grain yield per plant, and grain 
quality (starch content, protein content and oil content). 

10 

Lines that are most significantly improved compared to corresponding control lines are 
selected for further field-testing and marker-assisted breeding, with the objective of transferring 
the field-validated transgenic traits into commercial germplasm. The testing of maize for 
growth and yield-related parameters in the field is conducted using well-established protocols. 
15 The corn plants are particularly evaluated on yield parameters, such as for example, amount of 
plants per acre, amount of ears per plant, amount of rows per ear, amount of seeds per row 
and TKW. Subsequent improvements for introgressing specific loci (such as transgene 
containing loci) from one germplasm into another is also conducted using well-established 
protocols. 

20 
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Claims 

1. Method for increasing plant yield relative to corresponding wild type plants, comprising 
modifying expression in a plant of a nucleic acid sequence encoding a 2xC2H2 zinc finger 

5 protein and/or modifying in a plant level and/or activity of a 2xC2H2 zinc finger protein. 

2. Method for increasing leaf surface area relative to corresponding wild type plants, 
comprising modifying expression in a plant of a nucleic acid sequence encoding a 2xC2H2 
zinc finger protein and/or modifying in a plant level and/or activity of a 2xC2H2 zinc finger 

10 protein. 

a Method for prolonging vegetative growth phase of a plant relative to corresponding wild 
type plants, comprising modifying expression in a plant of a nucleic acid sequence 
encoding a 2xC2H2 zinc finger protein and/or modifying in a plant level and/or activity of a 
15 2xC2H2 zinc finger protein. 

4. Method according to any of claims 1 to 3, wherein said modifying expression, level and/or 
activity is effected by recombinant means and/or chemical means. 

20 5. Method according to any of claims 1 to 4, wherein said 2xC2H2 zinc finger protein 
comprises a QALGGH motif. 

6. Method according to any of claims 1 to 4, wherein said 2xC2H2 zinc finger protein 
comprises a NNM(W)QMH motif. 

25 

7. Method according to any of claims 1 to 6, wherein said 2xC2H2 zinc finger protein 
comprises an EAR motif. 

8. Method according to any of claims 1 to 7, wherein said 2xC2H2 zinc finger protein further 
30 comprises a B-box. 

9. Method according to any of claims 1 to 8, wherein said 2xC2H2 zinc finger protein further 
comprises an L-box. 

35 10. Method according to any of claims 1 to 9, wherein said 2xC2H2 zinc finger protein is 
derived from a dicotyledonous plant, preferably from the family Brassicaceae, further 
preferably from Arabidopsis thaliana, more preferably the nucleic add is as represented by 
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SEQ ID NO 2 or a homologue, derivative or active fragment thereof and/or wherein said 
nucleic acid is as represented by SED ID NO 1 or a portion thereof or sequences capable 
of hybridising therewith . 

5 11. Method according to claim 10, wherein said homologue, derivative or active fragment has, 
in increasing order of preference, at least 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 
38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 52%, 54%, 
56%, 58%, 60%, 62%, 64%, 66%, 68%, 70%, 72%, 74%, 76%, 78%, 80%, 82%, 84%, 
86%, 88%, 90%, 92%, 94%, 96%, 98% sequence identity with the sequence of SEQ ID NO 

10 2. 

12. Method according to any of claims 1 to 1 1 , wherein said plant Is a monocot 

13. Method according to any of claims 1 to 12, wherein said modifying expression is effected 
15 by introducing into a plant a nucleic acid capable of modifying expression of a gene 

encoding a 2xC2H2 zinc finger protein and/or capable of modifying level and/or activity of a 
2xC2H2 zinc finger protein. 

14. Method according to claim 13, wherein said nucleic acid capable of modifying expression is 
20 a nucleic acid encoding a 2xC2H2 protein, such as a 2xC2H2 protein as defined in any of 

claims 5 to 11. 

15. Method according to claims 13 or 14, wherein said nucleic acid introduced into a plant is an 
alternative splice variant of a nucleic acid as defined in claim 14. 

25 

16. Method according to claims 13 or 15, wherein said nucleic acid introduced into a plant is an 
allelic variant of a nucleic acid as defined in claim 14. 

17. Method according to claims 13 or 16, wherein said nucleic acid introduced into a plant is 
30 comprised on at least part of a chromosome. 

18. Method according to any of claims 1 to 17, wherein said modifying expression comprises 
increased expression. 

35 19. Method according to any of claims 1 to 18, wherein expression of said nucleic acid is 
driven by a plant promoter, preferably a constitutive promoter, such as a GOS2 promoter. 
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20. Method according to any of claims 1 to 18, wherein expression of said nucleic acid is 
driven by a plant promoter, preferably a tissue preferred promoter, such as seed-preferred 
promoter. 

5 21. Method according to any of claims 1 to 20, wherein said increased yield comprises 
increased above ground bfomass. 

22. Method according to any of claim 1 to 20, wherein said increased yield comprises 
increased seed yield. 

10 

23. Method according to any of claim 1 to 20, wherein said increased yield comprises 
increased root yield. 

24. Construct comprising: 

15 (i) A nucleic acid capable of modifying expression of a nucleic acid encoding a 

2xC2H2 zinc finger protein and/or capable of modifying level and/or activity of a 
2xC2H2 zinc finger protein; 
(ii) One or more plant control sequence capable of driving expression of the 
nucleic acid sequence of (i); and optionally 

20 (iii) A transcription termination sequence. 

25. Construct according to claim 24, wherein said nucleic acid of (i) is a nucleic acid as defined 
in any of claims 1 4 to 17. 

25 26. Construct according to claim 24 or 25, wherein said control sequences of (ii) is at least a 
constitutive promoter, such as a GOS2 promoter. 

27. Construct according to claim 24 or 25, wherein said control sequences of (ii) is at least a 
tissue preferred promoter, such as seed-preferred promoter. 

30 

28. Host cell comprising a construct according to any of claims 24 to 27. 

29. Method for the production of a transgenic plant having increased yield, increased leaf 
surface area and/or prolonged vegetative growth, which method comprises 

35 (i) introducing into a plant or plant cell a 2xC2H2 zinc finger nucleic acid; 

(ii) Cultivating the plant or plant cell under conditions promoting plant 
growth. 
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30. Plant obtainable by a method according to any of claims 1 to 23 and 29, which plant has 
increased yield, modified leaf surface area and/or prolonged vegetative growth, relative to 
corresponding wild type plants. 

5 

31. Transgenic plant having increased yield, increased leaf surface area and/or prolonged 
vegetative growth, which transgenic plant has modified expression of a nucleic acid 
encoding a 2xC2H2 zinc finger protein and/or modified level and/or activity of a 2xC2H2 
zinc finger protein, relative to corresponding wild type plants. 

10 

32. Plant part, preferably a haivestable part, a propagule or progeny of a plant as defined in 
claim 30 or 31, which progeny has modified expression of a nucleic acid encoding 2xC2H2 
zinc finger protein and/or modified level and/or activity of a 2xC2H2 zinc finger protein, 
relative to corresponding wild type plants. 

15 

33. Plant or plant part according to any of claims 30 to 32, which plant is a monocotyledonous 
plant, preferably a cereal. 

34. Plant or plant part according to any of claims 30 to 33 selected from rice, maize, wheat, 
20 barley, millet, oats, rye, sorghum, soybean, sunflower, canola, sugarcane, alfalfa, 

leguminosae (bean, pea), flax , lupinus, rapeseed, tobacco, tomato, potato, squash, 
papaya, poplar and cotton. 

35. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
25 construct as defined in any of claims 24 to 27 to increase plant yield. 

36. A yield regulating composition comprising a nucleic acid encoding a 2xC2H2 protein, and/ 
or comprising a 2xC2H2 protein, and/or comprising a construct as defined in any one of 
claims 24 to 27. 

30 

37. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
construct as defined in any of claims 24 to 27 to increase leaf surface area. 

38. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
35 construct as defined in any of claims 24 to 27 to prolong vegetative growth. 
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39. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
construct as defined in any of claims 24 to 27 as target for an agrochemical. 

40. Use of a nucleic acid encoding a 2xC2H2 protein, of a 2xC2H2 protein and/or of a 
5 construct as defined in any of claims 24 to 27 in a breeding program. 

41. Use of a plant as defined in any of claims 30 to 34 to produce enzymes, pharmaceuticals 
or agrochemicais. 

10 42. Use of a plant as defined in any one of claims 30 to 34 to produce food or feed products. 



15 
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SEQ ID NO 1: Arabidopsis thaliana STZ cDNA (CDS1536) 

AATGGCGCTCGAGGCTCTTACATCACCAAGATTAGCTTCTCCGATTCCTCCTTTGTTCGAAG 
ATTCTTCAGTCTTCCATGGAGTCGAGCACTGGACAAAGGGTAAGCGATCTAAGAGATCAAGA 
TCCGATTTCCACCACCAAAACCTCACTGAGGAAGAGTATCTAGCTTTTTGCCTCATGCTTCT 
CGCTCGCGACAACCGTCAGCCTCCTCCTCCTCCGGCGGTGGAGAAGTTGAGCTACAAGTGTA 
GCGTCTGCGACAAGACGTTCTCTTCTTACCAAGCTCTCGGTGGTCACAAGGCAAGCCACCGT 
AAGAACTTATCACAGACTCTCTCCGGCGGAGGAGATGATCATTCAACCTCGTCGGCGACAAC 
CACATCCGCCGTGACTACTGGAAGTGGGAAATCACACGTTTGCACCATCTGTAACAAGTCTT 
TTCCTTCCGGTCAAGCTCTCGGCGGACACAAGCGGTGCCACTACGAAGGAAACAACAACATC 
AACACTAGTAGCGTGTCCAACTCCGAAGGTGCGGGGTCCACTAGCCACGTTAGCAGTAGCCA 
CCGTGGGTTTGACCTCAACATCCCTCCGATCCCTGAATTCTCGATGGTCAACGGAGACGACG 
AAGTCATGAGCCCTATGCCGGCGAAGAAGCCTCGGTTTGACTTTCCGGTCAAACTTCAACTT 

TAAGGAAATT 

SEQ ID NO 2: Arabxdopsxs thaliana STZ protein with annotation 
of the domains 

B-BOX L-BOX 

MAIjEALTSPRIASPIPPLF EDSSVFHGVE 

ardnrqpppppav eklsyk!csvcdktfssyqai*gghka^ 



ARDNRQPPPPPAV ElQ^j^^^^^^^TTO^^^^^^^^Sg RKNLSQTLSGGGDDHSTSSATT 
TSAVTTGSGKSHV ^I^^rog^^^^^^^ yEGNNNINTSSVSNSEGAGSTSHVSSSH 
RGFDI^PPIPEFSMVNGDDEVMSPMPAKKPRFDFPVKLQL 



EAR motif 



SEQ ID NO 3: PRM3204 (sense, start codon in italics) 

5' GGGGACAAGTTTGTACAAAAAAGCAGGCTTCACAATGGCGCTCGAGGCTC 3' 

SEQ ID NO 4: PRM3205 (reverse, complementary stop codon in 
italics) 

5' GGGGACC ACT T T GT AC AAGAAAGC TGGGT AATT TCC TTAAAGT TGAAGTTT GA 3' 



SEQ ID NO 5: QALGGH motif 
QALGGH 

SEQ ID NO 6: NNM motif 
NN (M/W) QMH 

SEQ ID NO 7: EAR motif 
hDLNh (X) P 

SEQ ID NO 8: B-Box 
KR (S) KRXR 

SEQ ID NO 9: L-Box 
EXEXXAXCLXXL 
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ORTHOLOGS OF STZ in OTHER PLANT SPECIES 

SEQ ID NO 10: Dg_AF119050_l Datisca glomerata zinc-finger 
protein 1 (zfpl) mRNA, complete cds 

GGCACGAGGACAAATTCTCTCTCTATCCTCTGAATATCTTTGGTTTGTGAACTGAGAAGCTA 
TTAGATGGCTCTAGAAGCGCTCAACTCTCCGACCACAGCTACGCCGGTGTTTCACTACGACG 
ACCCCAGCTTGAATTACCTTGAGCCATGGACCAAGCGTAAGCGTTCCAAGCGTACGCGCTTA 
GATAGCCCCATACCGAGGAAGAGTACCTTGCTTTCTGCCTCATCATGCTCGCTCGTGGCCGC 
GTTGCCTCTGCAAATCGACGGGATTCTCAGTCTTCCATTCAGATTCAGCCTGAAGCAACGAC 
TTCGGCTACCAAAGTCAGTTATAAGTGCTCTGTGTGCGATAAGGCCTTTTCGTCTTATCAGG 
CTTTGGGTGGGCACAAGGCCAGCCACAGAAAGCTCGCTGGCGGCGAAGATCAATCGACTTCC 
TTTGCCACCACGAATTCAGCCACCGTCACTACCACCACAGCCTCCGGAGGTGGTGGCAGGTC 
TCATGAGTGTTCTATTTGCCACAAATCGTTCCCGACTGGCCAGGCCTTGGGTGGTCACAAGC 
GCTGCC AC TACGAAGGC AGTATCGGCGGC AAT AGT ATTC ACC AC CACAACAAT ACC ACC AAC 
AGCGGAAGCAACGGT GGC ATGAGCATG ACC TCCGAAGT AGGT TCCACACAC AC AGTC AGCC A 
CAGTCACCGTGACTTCGATCTCAACATCCCGGCCTTGCCGGAGTTTCGGTCGAATTTCTTCA 
TATCCGGGGATGACGAGGTCGAGAGTCCTCATCCGGCCAAGA2VACCCCGTATATTGATGAAA 
IPAAAAC AT TTC TC AAGATC ACTGAACC AGGCT TTAGTT T CTT TATAGG AGGAGAT TT AAAAA 
AGTAGTATCTCTCTTTCTTTATCCGTAGGATAATTAATATATTTCGTGTACATAAATTTGTA 
GTTCTTTAACACACTCTGTTTCATTTTCTTGCTTTGCTCAACTTTGTATTGGTTATTTCATT 

ATGAAAATTCAATT 

SEQ ID NO 11: Dg_AF119050_l Datisca glomerata, STZ ortholog, 
protein 

MALELALNSPTTATPVFHYDDPSIiNYLEPWTKRKRS 

ASANRRDSQSSIQIQPBATTSATKVSYKCSVCDKAFSSYQALGGHKASHRKLAGGEDQSTSF 
ATTNSATVTTTTASGGGGRSHECSICHKSFPTGQALGGHKRCHYEGSIGGNSIHHHNNTTNS 
GSNGGMSMTSEVGSTHWSHSHRDFDLNIPALPEFRSNFFISGDDEVESPHPAKKPRILMK 

SEQ ID NO 12:GmJT09602_U687 63.1J3MU68763 Glycine max (soybean) 

probable zinc finger protein SCOF-1 mRNA, complete cds 

AAAATT CTCAC TCTCTC TC TC ATCT CGAGATC AT AGT ATC AT AT TC AAT AT C ATT TC AT ACC 

AAACACATGGCTTTGGAAGCTCTCAACTCACCAACAACAACCGCTCCATCTTTTCCCTTTGA 

CGACCCAACTATTCCATGGGCGAAACGAAAACGTTCAAAGCGTTCTCGCGACCATCCTTCTG 

AAGAAGAGTACCTCGCCCTCTGCCTCATCATGCTCGCTCGCGGCGGCACCACCACCGTCAAC 

AACCGCCACGTCAGCCCTCCGCCGCTACAGCCACAGCCACAGCCGACACCAGATCCTTCCAC 

CAAGCTCAGTTACAAATGCTCCGTTTGCGACAAGAGCTTCCCCTCTTACCAAGCGCTCGGTG 

GACACAAGGCCAGTCACCGGAAACTCGCCGGCGCCGCCGAAGACCAACCCCCCAGCACCACC 

ACTTCCTCCGCCGCCGCCACCAGCTCCGCCTCCGGAGGTAAGGCCCATGAGTGCTCCATTTG 

CCACAAATCCTTCCCCACCGGACAGGCCCTTGGCGGACACAAACGTTGTCACTACGAAGGTA 

ACGGTAACGGAAATAACAACAACAGTAACAGCGTTGTCACCGTCGCCTCGGAAGGCGTGGGC 

TCCACCCACACTGTCAGTCACGGCCACCACCGCGACTTCGATCTCAACATCCCGGCCTTTCC 

GGATTTTTCGACCAAGGTCGGAGAAGACGAGGTTGAGAGCCCTCACCCTGTCATGAAGAAGC 

CTCGCCTCTTCGTCATTCCCAAGATCGAAATCCCCCAATTTCAATGAACTCGTTGAATTTTT 

AGTTTATTTTTCGACTATATATTTTGGAGAATTTTGAGAGTTACTATAATTTGATTTTGTAC 

ATAGTACTTGGAAGTTTTGTTGGACCGTACCGGACCCAGTTCTCTGGTTGAGGTTGTACTTT 

CACAACAGTGGC^GATTTGCAATTCAATTCAATTTATTTGTTTATTTTAAAAAAAAAAAAAA 

AAAA 

FIGURE 3 (continued) 
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SEQ ID NO 13: Gm_T09602 Glycine max (soybean) probable zinc 
finger protein SCOF-1, STZ otholog, protein 

MM1EIALNSPTTTAPSFPFDDPTIPWAKRKRSKRSRDHPSEEEYI1ALCLIMLARGGTTTVNNR 
HVSPPPLQPQPQPTPDPSTKLSYKCSVCDKSFPSYQALGGHKASHRKLAGAAEDQPPSTTTS 
SAAATSSASGGKAHECSICHKSFPTGQALGGHKRCHYEGNGNGNNNNSNSVVWASEGVGST 
HTVS HGHHRDFDLN I PAFPDF STKVGEDE VES PHP VMKKPRLFVT PKI E I PQFQ 

SEQ ID NO 14: Ms_CAB77055_ Y18788 . 1_MSY18788 Medicago sativa 

putative TFIIIA (or kruppel) -like zinc finger protein mRNA 

AATTCGGCACGAGAAATAACCACTTCTCTCTCAAAACCTCCTTTTGCCTTTTGCTTCTACTT 

TCACTTGCGTAACGCTAACTAACTCTTCTCGAGTGTTCXTCTTTTCATCATATGGCTATGGA 

AGCACTTAACTCACCCACCACTGCTACTCCTTTCACACCCTTTGAGGAACCAAATCTGAGTT 

ATCTTGAAACACCGTGGACGAAAGGTAAACGATCAAAGCGTTCTCGCATGGATCAATCTTCA 

TGCACTGAAGAAGAGTATCTCGCTCTTTGTCTCATCATGCTTGCTCGCAGCGGTAACAACAA 

CGACAAAAAGTCTGATTCGGTGGCGACGCCGCTAACCACCGTTAAACTCAGTCACAAATGCT 

CAGTCTGCAACAAAGCTTTCTCATCTTATCAAGCCCTAGGTGGACACAAAGCCAGTCACCGG 

AAAGCTGTTATGTCCGCAACCACCGCTGAAGATCAGATCACCACCACTTCATCCGCCGTGAC 

TACCAGCTCTGCTTCCAACGGTAAGAACAAGACTCATGAGTGTTCCATCTGTCACAAATCCT 

TCCCTACTGGACAGGCTTTGGGAGGACACAAGCGTTGTCACTACGAAGGCAGCGTTGGTGCC 

GGTGCCGGTGCTGGAAGTAACGCTGTAACTGCCTCTGAAGGAGTTGGATTGTCACACAGCCA 

CCACCGTGATTTTGATCTTAACCTCCCGGCTTTTCCGGACTTTTCAAAGAAGTTTTTCGTGG 

ATGACGAGGTTTTTAGTCCTTTACCTGCTGCAAAGAAGCCCTGTCTTTTCAAGCTGGAAATT 

CCTTCTCATTACTGATCAATAATAGATCCAATTTTATTGTTATTATTATTAATAATTATTAT 

CGCTTAGGGCATAGTTATTTTCTTTTTTCTTTCAATTATTTCGGATCAATTTGTTCTGTACA 

TACAAATTGGGATTGGTTTTAGAATTTAGGACGGTTGTAGACAATGGAAATTCAATTCAATT 

ATTTAATTTTGTGT 

SEQ ID NO 15: Ms_CAB77055_Medicago sativa putative TFIIIA (or 
kruppel) -like zinc finger protein, STZ otholog, protein 
MAMEALNSPTTATPFTPFEEPNLSYLETPWTKGKRSKRSRMDQSSCTEEEYIiALCLIMIiARS 

GNNN DKKS DS VATPLT TVKL S HKC SVCNKAF S S YQALGGHKAS HRKAVMS AT TAEDQI T T TS 
SAVTTSSASNGKNKTHECSICHKSFPTGQALGGHKRCHYEGSVGAGAGAGSNAVTASEGVGL 
SHSHHRDFDLNLPAFPDFSKKFFVDDEVFSPLPAAKKPCLFKLEIPSHY 

SEQ ID NO 16: Nt_AAC06243_ AF053077 Nicotiana tabacum osmotic 

stress-induced zinc-finger protein (zfp) mRNA, complete cds 

TTTTCCCTCGAATTTGATAACTAAAGAGAATATTATGACTCTTGAAGCTTTGAAGTCACCTA 

CGGCGGCAACGCCGACTCTACCACCACGCTA5PSAAGATGATGATGAAATTCATAATTTGGAT 

TCTTGGGCTAAAGGAAAACGATCAAAACGGCCCCGTATTGATGCCCCACCGACTGAAGAAGA 

GTATTTAGCCCTCTGTCTCATCATGCTCGCTCGCAGCGGAACCGGAACCAGAACCGGTTTAA 

CTGATGCTACTACTTCCCAACAACCTGCCGATAAAAAAACCGCCGAGTTGCCGCCGGTTCAT 

AAGAAAGAGGTGGCAACAGAGCAAGCAGAGCAATCTTACAAGTGTAGCGTGTGTGACAAGGC 

TTTTTCTTCTTATCAAGCACTCGGTGGGCATAAAGCAAGTCACCGTAAAACTACTACTACTG 

CTACCGCCGCCTCTGATGATAACAATCCTTCAACTTCAACTTCCACTGGCGCCGTTAATATC 

TCTGCTCTTAATCCAACTGGTCGTTCACACGTCTGTTCTATTTGCCACAAGGCTTTTCCTAC 

TGGCCAAGCTTTGGGTGGGCACAAGCGCCGCCACTATGAAGGCAAACTCGGTGGTAACAGCC 

GCGACTTAGGCGGCGGCGGCGGCGGCGGTCATAGTGGAAGCGTCTTGACTACTTCAGACGGC 
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GGCGCGTCGACTCACACGCTACGTGACTTTGACCTGAACATGCCTGCTTCGCCGGAATTGCA 
ACTGGGTCTGAGTATTGATTGTGGACGGAAAAGTCAACTGTTGCCGATGGTCCAAGAGGTGG 
AAAGTCCTATGCCTGCAAAGAAACCGCGTTTATTGTTTTCGTTGGGTTCaAACTTCTTTAGG 
GGAATTGAATTGATTGTGTTTTAGCCAAATTAGTAAATTGGTTCATGTGATTTTATTTTTAG 
GAAAAGGAATTATTGATTGTTTTACCCGTTTATTCTTAGGGTGGTATTATGTACAGGGAGTG 
AATCATTCATTGGTTTTACACTTTCTTAATTATATATTCTTTTTTTTTACACATAAAAAAAA 

AAAAAAA 

SEQ ID NO 17: Nt_AAC06243_Nicotiana tabacum osmotic stress- 
induced zinc-finger protein, STZ otholog, protein 
MTLEALKSPTAATPTLPPRYEDDDEIHNLDSWAKGKRSKRPRIDAPPTEEEY1ALCLIMLAR 

SGTGTRTGLTDATTSQQPADKKTAELPPVHKKEVATEQAEQSYKCSVCDKAFSSYQALGGHK 
ASHRKTTTTATAASDDNNPSTSTSTGAVNISALNPTGRSHVCSICHKAFPTGQALGGHKRRH 
YEGKLGGNSRDLGGGGGGGHSGSVLTTSDGGASTHTLRDFDLNMPASPELQLGLSIDCGRKS 

QLLPMVQEVESPMPAKKPRLLFSLG 

SEQ ID NO 18: Os_AF332876 Oryza sativa zinc finger 
transcription factor ZF1 mRNA, complete cds 

AATTCGGCACGAGGCCACACAGCAACCAGCCAGCTGCCACACTAGCTTGAGGCGAGCGAGCG 

AAGCTTAGCTAGCGGATAGAACAAGTCGTCGATCTGCTTGCTGCTTTTGTGAATTGCGGTGG 

AAGCATGTCGAGCGCGTCGTCCATGGAAGCGCTCCACGCCGCGGTGCTCAAGGAGGAGCAGC 

AGCAGCACGAGGTGGAGGAGGCGACGGTCGTGACGAGCAGCAGCGCCACGAGCGGGGAGGAG 

GGCGGACACCTGCCCCAGGGGTGGGCGAAGCGGAAGCGGTCGCGCCGCCAGCGATCGGAGGA 

GGAGAACCTCGCGCTCTGCCTCCTCATGCTCGCCCGCGGCGGCCACCACCGCGTCCAGGCGC 

CGCCTCCGCTCTCGGCTTCGGCGCCCCCGCCGGCAGGTGCGGAGTTCAAGTGCTCCGTCTGC 

GGCAAGTCCTTCAGCTCCTACCAGGCGCTCGGCGGCCACAAGACGAGCCACCGGGTCAAGCT 

GCCGACTCCGCCCGCAGCTCCCGTCTTGGCTCCCGCCCCCGTCGCCGCCTTGCTGCCTTCCG 

CCGAGGACCGCGAGCCAGCCACGTCATCCACCGCCGCGTCCTCCGACGGCATGACCAACAGA 

GTCCACAGGTGTTCCATCTGCCAGAAGGAGTTCCCCACCGGGCAGGCGCTCGGCGGGCACAA 

GAGGAAGCACTACGACGGTGGCGTAGGCGCCGGCGCCGGCGCATCTTCAACCGAGCTCCTGG 

CCACGGTGGCCGCCGAGTCCGAGGTGGGAAGCTCCGGCAACGGCCAGTCCGCCACCCGGGCG 

TTCGACCTCAACCTCCCGGCCGTGCCGGAGTTCGTGTGGCGGCCGTGCTCCAAGGGCAAGAA 

GATGTGGGACGAGGAGGAGGAGGTCCAGAGCCCCCTCGCCTTCAAGAAGCCCCGGCTTCTCA 

CCGCGEAATTCAGCAGCTGCACGGATCCGATCCGTCAGAGTTTTTGTCTAGGGAGTGAAATT 

CAGTCGAAACACACTATTCGTTGATTCGTTTTGTGCCGCTATTGTTTAATTTGTTCCTGCTT 

TTGTACAGAGCAAGCGAGTGATACATAGCCATACATACAGTCATACAGATATAGGTCTAGCT 

CTTCCTTGGTTCTTTGTAACACTGGAACTGTACCTGTATCTTTTACACTTTGTTCTTTGACA 

GTCATATATTGTAGACCAAAAAAAAAAAAAAAAAA 

SEQ ID NO 19: Os_AF332876 Oryza sativa seedling zinc finger 
transcription factor ZF1, STZ otholog, protein 

MSSASSMEALHAAVLKEEQQQHEVEEATVWSSSATSGEEGGHLPQGWAKFKRSRRQRSEEE 
NLALCLLMLARGGHHRVQAPPPLSASAPPPAGAEFKCSVCGKSFSSYQALGGHKTSHRVKLP 
TPPAAPVLAPAPVAALLPSAEDREPATSSTAASSDGMTNRVHRCSICQKEFPTGQALGGHKR 
KHYDGGVGAGAGASSTELLATVAAESEVGSSGNGQSATRAFDLNLPAVPEFVWRPCSKGKKM 

WDEEEEVQSPLAFKKPRLLTA 
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SEQ ID NO 20: Ph_BAA05079JD26086.1 [Petunia x hybrida]. 
PETZFP4 zinc-finger protein gene • 

TTCACTCACCAAAACAACTTCTCTACCTCTTCTACTTGCACATTCAAATTCTTTCATTACTA 
CT TATCTC T ACTAATCT TGAT TCGATT TT AGT AAAT CAAACAAGAG AAT CT TT TC AGTAAT A 
CAAACAAG AAAAT TT TC TCTCTATACT TGATT GAGT TT AGT AAGGC AAACAAGAAAACT AT C 
ATGGCACT TGAAGCATT GAAT TCTCCAACT ACAACAACACCACC ATCAT TCCAAT TTGAGAA 
CAACGGGC TT AAGT ACC TTGAGAGT TGGAC AAAAGGTAAAAG ATCAAAAAGGC AACGCAGC A 
TGGAACGACAGTGT ACTGAAGAAGAGT AT T TAGCACTT TGTC TT ATCATGC TAGC ACGT AGC 
GATGGTTCTGTTAATAACTCACGGTCTCTACCACCACCACCACTACCACCATCAGTTCCAGT 
AACGTCGCAAATAAACGCGACGTTATTGGAACAGAAGAATTTGTACAAGTGTTCCGTTTGTG 
GTAAAGGGTTTGGGTCTTATCAAGCTTTAGGTGGACATAAAGCAAGTCACCGGAAACTTGTC 
AGC ATGGGAGG AGATGAAC AATCT ACT AC T TCCACT ACTACT AACGTAACGGG AACT AGTTC 
CGCTAACGTTAACGGTAACGGAAGAACTCACGAATGTTCAATTTGTCACAAGTGCTTTCCTA 
CTGGACAAGCTTTAGGTGGTCATAAAAGGTGCCACTATGACGGTGGTAACGGTAACGGTAAC 
GGAAGTGTAAGTGTTGGGGTGACGTCATCTGAAGGTGTGGGGTCCACTATTAGTCATCACCG 
TGACTTTGACTTGAATATTCCCGCGTTGCCGGAGTTTTGGCCGGGATTTGGTTCCGGCGAGG 
ATGAGGTGGAGAGTCCTCATCCAGCAAAGAAGTCAAGGCTATCTCTTCCACCTAAACTTGAA 
TT AT TC AAAGGATTATAGAGGGAATAT TG ATT TGTTAC AGGAAG ATTT ATT AG GATT CACGA 
AT TT TT TGT TGACT AGT TT AT GT AATAT 

SEQ ID NO 21: Ph_BAA05079 [Petunia x hybrida] zinc-finger 
protein, STZ otholog, protein 

MALEALNSPTTTTPPSFQFENNGLKYLESWTKGKRSKRQRSMERQCTEEEYLALCLIMLARS 
DGSVNNSRSLPPPPLPPSVPVTSQINATLLEQKNLYKCSVCGKGFGSYQALGGHKASHRKLV 
SMGGDEQSTTSTTTNVTGTSSANVNGNGRTHECSICHKCFPTGQALGGHKRCHYDGGNGNGN 
GSVSVGVTSSEGVGSTISHHRDFDLNIPALPEFWPGFGSGEDEVESPHPAKKSRLSLPPKLE 

LFKGL 

SEQ ID NO 22: Ta_BAA03901 Triticum aestivum gene for zinc- 
finger protein WZF1, complete cds 

ATGTCGTCGTCGGCCATGGAAGCGCTCCACGCCCTGATCCCGGAGCAGCACCAGCTGGACGT 
TGAGGCGGCTGCGGCTGTCAGCAGCGCCACCAGCGGCGAGGAGAGCGGCCACGTGCTGCAGG 
GGTGGGCCAAGAGGAAGCGATCGCGCCGCCAGCGCTCCGAGGAGGAGAACCTCGCGCTCTGC 
CTCCTCATGCTCTCGCGCGGCGGCAAGCAGCGTGTTCAGGCGCCGCAGCCGGAGTCGTTCGC 
TGCGCCGGTGCCTGCCGAGTTCAAGTGCTCCGTCTGCGGCAAGTCCTTCAGCTCCTACCAGG 
CGCTCGGAGGCCACAAGACGAGCCACCGGGTGAAGCAGCCGTCTCCTCCCTCTGATGCCGCT 
GCTGCCCCACTCGTGGCCCTCCCGGCCGTCGCCGCCATCCTGCCGTCCGCCGAGCCGGCCAC 
GTCGTCCACCGCCGCGTCCTCCGACGGCGCGACCAACAGAGTCCACAGGTGCTCCATCTGCC 
AAAAGGAGTTCCCGACTGGGCAGGCGCTCGGCGGGCACAAGAGGAAGCACTACGACGGAGGC 
GTGGGCGCCGCCGCCTCGTCGACCGAGCTTCTGGCCGCCGCGGCCGCCGAGTCTGAGGTGGG 
GAGCACCGGCAACGGGAGCTCCGCCGCCCGGGCCTTCGACCTGAACATTCCGGCCGTGCCGG 
AGTTCGTGTGGAGGCCGTGCGCCAAGGGCAAGATGATGTGGGAGGACGATGAGGAGGTGCAG 
AGCCCCCTCGCCTTCAAGAAGCCTCGGCTTCTCACCGCTTGA 
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SEQ ID NO 23: Ta_BAA03901_WZFl Triticum aestivum, STZ otholog, 
protein 

^SSAMElALHALIPEQHQLDVE2VAAAVSSATSGEESGHVLQGWAKRKRSRRQRSEEENLALC 
LLMLSRGGKQRVQAPQPESFAAPVPAEFKCSVCGKSFSSYQALGGHKTSHRVKQPSPPSDAA 
AAPLVALPAVAAILPSAEPATSSTAASSDGATNRVHRCSICQKEFPTGQALGGHKRKHYDGG 
VGAAAS STELLAAAAAESEVGSTGNGS S AARAFDLNI P AVPE FVWRPCAKGKMMWE DDEEVQ 
SPLAFKKPRLLTA 

SEQ ID NO 24: Ca AF539746 Capsicum annum zinc finger protein 

mRNA, complete cds , 

AAAATCTTCGCTACTTACTTACATCTTCTAGAATAGTCACTAGAACCAGTAACTTTATACAA 

m* nw* m jTOnrifTTRiaRrTTTGA ATTCTCCAACTGGTACACCAACTCCGCCACCG 



SEQ ID NO 25: Ca AF539746-capsicum annum, STZ otholog, protein 
MALEALNSPTGTPTPPPFQFESDGQQLRYIENWRKGKRSKRSRSMEHQPTEEEYLALCLIML 
ARSGGSVNHQRSLPPPAPVMKLHAPSSSSAAEEEKEKMVYKCSVCGKGFGSYQALGGHKASH 
RKLVPGGDDQSTTSTTTNATGTTTSVNGNGNRSGRTHECSICHKCFPTGQALGGHKRCHYDG 
GIGNGNANSGVSASVGWSSEGVGSTVSHRDFDLNIPALPEFWLGFGSGEDEVESPHPAKKS 

KLCLPPKYELFQH 

PARALOGS OF STZ IN ARABIDOPSIS THALIANA 

SEQ ID NO 26: gi JL8402298_ref_NM_112848 . 1 mRNA 

ACTTCACTCTCTAATTTCCTTCTCTCTATCTCTCACCATATTCGCGATTAAAAACTCTCAAC 
TTTTCTCTCAAA.TTTCTGATCCTTTGATCCAACAGTTAGAAGAAGATTCATCTGATCATGGC 
CCTCGAA.GCGATGAACACTCCAACTTCTTCTTTCACCAGAATCGAAACGAAAGAAGATTTGA 
TGAACGACGCCGTTTTCATTGAGCCGTGGCTTAAACGCAAACGCTCCAAACGTCAGCGTTCT 
CACAGCCCTTCTTCGTCTTCTTCCTCACCGCCTCGATCTCGACCCAAATCCCAGAATCAAGA 
TCTTACGGAAGAAGAGTATCTCGCTCTTTGTCTCCTCATGCTCGCTAAAGATCAACCGTCGC 
AAACGCGATTTCATCAACAGTCGCAATCGTTAACGCCGCCGCCAGAATCAAAGAACCTTCCG 
TACAAGTGTAACGTCTGTGAAAAAGCGTTTCCTTCCTATCAGGCTTTAGGCGGTCACAAAGC 
AAGTCACCGAATCAAACCACCAACCGTAATCTCAACAACCGCCGATGATTCAACAGCTCCGA 
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CCATCTCCATCGTCGCCGGAGAAAAACATCCGATTGCTGCCTCCGGAAAGATCCACGAGTGT 
TCAATCTGTCATAAAGTGTTTCCGACGGGTCAAGCTTTAGGCGGTCACAAACGTTGTCACTA 
CGAAGGCAACCTCGGCGGCGGAGGAGGAGGAGGAAGCAAATCAATCAGTCACAGTGGAAGCG 
TGTCGAGCACGGTATCGGAAGAAAGGAGCCACCGTGGATTCATCGATCTAAACCTACCGGCG 
TTACCTGAACTCAGCCTTCATCACAATCCAATCGTCGACGAAGAGATCTTGAGTCCGTTGAC 
CGGTAAAAAACCGCTTTTGTTGACCGATCACGACCAAGTCATCAAGAAAGAAGATTTATCTT 
TAAAAATCTAATACTCGACTATTAATTCTTGTGTGATTTTTTTCGTTACAACCATAGTTTCA 
TTTTCATTTTTTTAGTTACAAATTTTTAATTGTTCTGATTTGGATTGAATATTGGTATATTG 

TTAGGGGTTGATAC 

SEQ ID NO 27: Translation of gi_18402298_ref_NM_112848 . 1_ 
MAIiEAMNTPTSSFTRIETKEDLMNDAVFIEPWLKRKRSKRQRSHSPSSSSSSPPRSRPKSQN 
QDLTEEEYLALCLLMLAKDQPSQTRFHQQSQSLTPPPESKNLPYKCNVCEKAFPSYQALGGH 
KASHRIKPPTVI STTADDSTAPTIS IVAGEKHPIAASGKI HECS ICHKVFPTGQALGGHKRC 
HYEGNLGGGGGGGSKSISHSGSVSSTVSEERSHRGFIDLNLPALPELSLHHNPIVDEEILSP 

LTGKKPLLLTDHDQVIKKEDLSLKI 

SEQ ID MO 28: gi 30680473_ref_NM_120516.3_mRNA 

AAATCAAATCTTTTCATTTACAATTATCTTTCTTCTCAATTTAGAACTTAGTAGCTAGTCTT 
CAAGATAATGGCACTTGAAACTCTTACTTCTCCAAGATTATCTTCTCCGATGCCGACTCTGT 
TTCAAGATTCAGCACTAGGGTTTCATGGAAGCAAAGGCAAACGATCTAAGCGATCAAGATCT 
GAATTCGACCGTCAGAGTCTCACGGAGGATGAATATATCGCTTTATGTCTCATGCTTCTTGC 
TCGCGACGGAGATAGAAACCGTGACCTTGACCTGCCTTCTTCTTCGTCTTCACCTCCTCTGC 
TTCCTCCTCTTCCTACTCCGATCTACAAGTGTAGCGTCTGTGACAAGGCGTTTTCGTCTTAC 
CAGGCTCTTGGTGGACACAAGGCAAGTCACCGGAAAAGCTTTTCGCTTACTCAATCTGCCGG 
AGGAGATGAGCTGTCGACATCGTCGGCGATAACCACGTCTGGTATATCCGGTGGCGGGGGAG 
GAAGTGTGAAGTCGCACGTTTGCTCTATCTGTCATAAATCGTTCGCCACCGGTCAAGCTCTC 
GGCGGCCACAAACGGTGCCACTACGAAGGAAAGAACGGAGGCGGTGTGAGTAGTAGCGTGTC 
GAATTCTGAAGATGTGGGGTCTACAAGCCACGTCAGCAGTGGCCACCGTGGGTTTGACCTCA 
ACATACCGCCGATACCGGAATTCTCGATGGTCAACGGAGACGAAGAGGTGATGAGTCCTATG 
CCGGCGAAGAAACTCCGGTTTGACTTCCCGGAGAAACCCTAAACATAAACCTAGGAAAAACT 
TTACAGAATTCATTTTATAGGAAATTGTTTTACTGTATATACAAATATCGATTTTGATTGAT 
GTTCTTCTTCACTGAAAAATTATGATTCTTTGTTGTATAATTGATGTTTCTGAAAAAGATAT 
AACTTTTTATTGTTTCACACGTATCAAAATTTGCTTGGATACATCA 

SEQ ID NO 29: Translation of gi_30680473_ref_NM_120516.3_ 

MALETLTSPRLSSPMPTLFQDSALGFHGSKGKRSKRSRSEFDRQSLTEDEYIALCLMLLARD 

GDRNRDLDLPSSSSSPPLLPPLPTPIYKCSVCDKAFSSYQALGGHKASHRKSFSLTQSAGGD 

ELSTSSAITTSGISGGGGGSVKSHVCSICHKSFATGQALGGHKRCHYEGKNGGGVSSSVSNS 

EDVGSTSHVSSGHRGFDLNIPPIPEFSMVNGDEEVMSPMPAKKLRFDFPEKP 
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SEQ ID MO 30: gi 30693252_ref_NM_114853.2_mRNA 

ATGGCTCTCGACACTCTCAATTCTCCCACCTCCACCACCACAACCACCGCTCCTCCTCCTTT 
CCTCCGTTGCCTCGACGAAACCGAGCCCGAAAACCTCGAATCATGGACCAAAAGAAAACGTA 
CAAAACGTCACCGTATAGATCAACCAAACCCTCCTCCTTCTGAAGAAGAGTATCTCGCTCTT 
TGCCTCCTTATGCTCGCTCGTGGCTCCTCCGATCATCACTCTCCACCGTCGGATCATCACTC 
TCTTTCTCCACTGTCCGATCATCAGAAAGATTACAAGTGTTCCGTCTGTGGCAAATCTTTCC 
CGTCTTACCAAGCGTTAGGTGGACACAAAACAAGTCACCGGAAACCGGTTAGTGTCGATGTT 
AATAATAGTAACGGAACCGTTACTAATAACGGAAATATTAGTAACGGTTTAGTTGGTCAAAG 
TGGGAAGACTCATAACTGCTCTATATGTTTTAAGTCGTTTCCCTCTGGTCAAGCATTGGGTG 
GTCACAAACGTTGTCACTATGATGGTGGTAACGGTAACAGTAACGGTGACAATAGCCACAAG 
TTTGACCTAAATTTACCGGCTGATCAAGTTAGTGATGAGACAATTGGAAAAAGTCAACTCTC 
CGGTGAAGAAACAAAGTCGGTGTTGTGATTATTATTATTTTTTACCGATCGGGATTAGCTAG 
TGGTTGATCATTAGCTGAGTCTGTAATGAAAATGAT 

SEQ ID NO 31: Translation of gi_30693252_ref_NM_114853.2 
MALDTLNSPTSTTTTTAPPPFLRCLDETEPENLESWTKRKRTKRHRIDQPNPPPSEEEYLAL 
CLtMLARGSSDHHSPPSDHHSLSPLSDHQKDYKCSVCGKSFPSYQALGGHKTSHRKPVSVDV 
NNSNGTVTNNGNISNGLVGQSGKTHNCSICFKSFPSGQALGGHKRCHYDGGNGNSNGDNSHK 

FDLNLPADQVSDETIGKSQLSGEETKSVL 

SEQ ID NO 32:. gi 30694224_ref_NM_123683 . 2_mRNA 

AAATTTTCTATAGCAATGGCGCTTGAAGCTCTTAATTCACCAAGATTGGTCGAGGATCCCTT 
AAGATTCAATGGCGTTGAGCAGTGGACCAAATGTAAGAAA.CGATCCAAACGTTCGAGATCTG 
ATCTTCATCATAACCACCGTCTCACTGAGGAAGAGTATCTAGCTTTCTGTCTCATGCTTCTT 
GCTCGGGATGGCGGCGATCTTGACTCTGTGACGGTTGCGGAGAAGCCGAGTTATAAGTGTGG 
CGTTTGTTACAAGACGTTTTCGTCTTACCAAGCTCTCGGCGGTCATAAAGCGAGCCACCGGA 
GCTTATACGGTGGTGGAGAGAATGATAAATCGACACCATCCACCGCCGTGAAATCTCACGTT 
TGTTCGGTTTGCGGGAAATCTTTCGCCACCGGTCAAGCTCTCGGCGGCCACAAGCGGTGCCA 
CTACGATGGTGGCGTTTCGAACTCGGAAGGTGTGGGGTCTACTAGCCACGTCAGCAGTAGTA 
GCCACCGTGGATTTGACCTTAATATTATACCGGTGCAGGGATTTTCGCCGGACGACGAAGTG 
ATGAGTCCGATGGCGACTAAGAAGCCTCGCCTGAAGTAAGTCTTTGTTGAAGACCTGGAAGT 
TTATCAAATGTAAATATCAAATTTCAATTTCAAGGAACAGTTTTGTTGATTCTATTACCAAT 

ACACAATACGATTCAATTCC 

SEQ ID NO 33: Translation of gi_30694224_ref_NM_123683 .2_ 
MALEALNSPRLVEDPLRFNGVEQWTKCKKRSKRSRSDLHHNHRLTEEEYLAFCLMLLARDGG 
DLDSVTVAEKPSYKCGVCYKTFSSYQALGGHKASHRSLYGGGENDKSTPSTAVKSHVCSVCG 
KSFATGQALGGHKRCHYDGGVSNSEGVGSTSHVSSSSHRGFDLNIIPVQGFSPDDEVMSPMA 

TKKPRLK 
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SEQ ID NO 34: gi_30698307_ref_NM_126145.2_mRNA 

CACACTTCACTCTTTCTTCATCTTCTTCTTCTTAAATAGCTCGAAATCACATCTCACAGAAT 

TAAATCTTATGGCTCTCGAGACTCTCAATTCTCCAACAGCTACCACCACCGCTCGGCCTCTT 

CTCCGGTATCGTGAAGAAATGGAGCCTGAGAATCTCGAGCAATGGGCTAAAAGAAAACGAAC 

AAAACGTCAACGTTTTGATCACGGTCATCAGAATCAAGAAACGAACAAGAACCTTCCTTCTG 

AAGAAGAGTATCTCGCTCTTTGTCTCCTCATGCTCGCTCGTGGCTCCGCCGTACAATCTCCT 

CCTCTTCCTCCTCTACCGTCACGTGCGTCACCGTCCGATCACCGAGATTACAAGTGTACGGT 

CTGTGGGAAGTCCTTTTCGTCATACCAAGCCTTAGGTGGACACAAGACGAGTCACCGGAAAC 

CGACGAACACTAGTATCACTTCCGGTAACCAAGAACTGTCTAATAACAGTCACAGTAACAGC 

GGTTCCGTTGTTATTAACGTTACCGTGAACACTGGTAACGGTGTTAGTCAAAGCGGAAAGAT 

TCACACTTGCTCAATCTGTTTCAAGTCGTTTGCGTCTGGTCAAGCCTTAGGTGGACACAAAC 

GGTGTCACTATGACGGTGGCAACAACGGTAACGGTAACGGAAGTAGCAGCAACAGCGTAGAA 

CTCGTCGCTGGTAGTGACGTCAGCGATGTTGATAATGAGAGATGGTCCGAAGAAAGTGCGAT 

CGGTGGCCACCGTGGATTTGACCTAAACTTACCGGCTGATCAAGTCTCAGTGACGACTTCTT 

AACGTTGACTGAGTTTGAGGAAAAAGTCAACTATCAAGCGAAGAAAGGGTTAGTGGACGGTG 

AAGATTAACGGTCGTTTCTTTCCAGTTGCTTCGGTTTGAGCTTGACTGGGTCTGTAATGAAA 

ATGATTGGAGTGGACTTGGCATTATTATTATTATTTTTAAAAAGAAATGTTAATTTGTTGTT 

GGATTTGTTTATAGATAGAGGAAACAATTGGGATACACAAATATTTTTTTTTTTTACAAAGA 

AAATAATAATGCAGAGATGGATGATTGGATCGTACACGTTATTATATAGTGGACCATTCTGT 

AATCGTGAATTATTATTATTTGTTAGAAATTTAATTTTCGT 

SEQ ID NO 35: Translation of gi_30698307_ref_NM_126145.2_ 
MALETLNSPTATTTARPLLRYKEEMEPENLEQWAKRKRTKRQRFDHGHQNQETNKNLPSEEE 
YLALCLIMLARGSAVQSPPLPPLPSRASPSDHRDYKGTVCGKSFSSYQALGGHKTSHRKPTN 
TSITSGNQELSNNSHSNSGSWINVTVNTGNGVSQSGKIHTCSICFKSFASGQALGGHKRCH 
YDGGNNGNGNGS S SNSVELVAGS DVSDVDNERWSEE SAI GGHRGFDLNLPADQVSVTT S 

OTHER GENES IN EVALUATION 

SEQ ID NO 36: gi_ 12698881_ref_ AF_332876.1 _mRNA, 2xC2H2, 
Oryza sativa 

AATTCGGCACGAGGCCACACAGCAACCAGCCAGCTGCCACACTAGCTTGAGGCGAGCGAGCG 

AAGCTTAGCTAGCGGATAGAACAAGTCGTCGATCTGCTTGCTGCTTTTGTGAATTGCGGTGG 

AAGCATGTCGAGCGCGTCGTCCATGGAAGCGCTCCACGCCGCGGTGCTCAAGGAGGAGCAGC 

AGCAGCACGAGGTGGAGGAGGCGACGGTCGTGACGAGCAGCAGCGCCACGAGCGGGGAGGAG 

GGCGGACACCTGCCCCAGGGGTGGGCGAAGCGGAAGCGGTCGCGCCGCCAGCGATCGGAGGA 

GGAGAACCTCGCGCTCTGCCTCCTCATGCTCGCCCGCGGCGGCCACCACCGCGTCCAGGCGC 

CGCCTCCGCTCTCGGCTTCGGCGCCCCCGCCGGCAGGTGCGGAGTTCAAGTGCTCCGTCTGC 

GGCAAGTCCTTCAGCTCCTACCAGGCGCTCGGCGGCCACAAGACGAGCCACCGGGTCAAGCT 

GCCGACTCCGCCCGCAGCTCCCGTCTTGGCTCCCGCCCCCGTCGCCGCCTTGCTGCCTTCCG 

CCGAGGACCGCGAGCCAGCCACGTCATCCACCGCCGCGTCCTCCGACGGCATGACCAACAGA 

GTCCACAGGTGTTCCATCTGCCAGAAGGAGTTCCCCACCGGGCAGGCGCTCGGCGGGCACAA 

GAGGAAGCACTACGACGGTGGCGTAGGCGCCGGCGCCGGCGCATCTTCAACCGAGCTCCTGG 

CCACGGTGGCCGCCGAGTCCGAGGTGGGAAGCTCCGGCAACGGCCAGTCCGCCACCCGGGCG 

TTCGACCTCAACCTCCCGGCCGTGCCGGAGTTCGTGTGGCGGCCGTGCTCCAAGGGCAAGAA 

GATGTGGGACGAGGAGGAGGAGGTCCAGAGCCCCCTCGCCTTCAAGAAGCCCCGGCTTCTCA 

CCGCGTAATTCAGCAGCTGCACGGATCCGATCCGTCAGAGTTTTTGTCTAGGGAGTGAAATT 

CAGTCGAAACACACTATTCGTTGATTCGTTTTGTGCCGCTATTGTTTAATTTGTTCCTGCTT 
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TTGTACAGAGCAAGCGAGTGATACATAGCCATACATACAGTCATACAGATATAGGTCTAGCT 
CTTCCTTGGTTCTTTGTAACACTGGAACTGTACCTGTATCTTTTACACTTTGTTCTTTGACA 

GTCATATATTGTAGACCAAAAAAAAAAAAAAAAAA 

SEQ ID NO 37: gi 12698882_ref_AAK01713.1, 2xC2H2, Oryza sativa 
MSSASSMEALHAAVLKEE5QQQHEVEEATVVTSSSATSGEEGGHLPQGWAKRKRSRRQRSEEE 
NLALCLLMLARGGHHRVQAPPPLSASAPPPAGAEFKCSVCGKSFSSYQALGGHKTSHRVKLP 

TP PAAPVL APAP VAALLPS AE DREP AT S S TAAS S DGMTNRVHRC S I CQKE FPTGQALGGHKR 
KHYDGGVGAGAGASSTELLATVAAESEVGSSGNGQSATRAFDLNLPAVPEFVWRPCSKGKKM 

WDEEEEVQSPLAFKKPRLLTA 

SEQ ID NO 38: gi_6434215_ref_AL132966. l_region 116202 ... 
116729, 2xC2H2, Arabidopsis thaliana 

ATGAAGAGAGACCGGTCCGATTACGAAGAATCCATGAAGCATATAGACATAGTAGAAAGTCT 
AATGATGTTATCTCGAAGTTTCGTGGTCAAACAAATCGATGTAAAGCAATCTACCGGAAGCA 
AAACGAACCATAATAACCACTTCGAATGCAAAACGTGTAACCGGAAATTTGATTCCTTCCAA 
GCTCTTGGAGGTCATAGAGCTAGCCACAAGAAACCTAAGCTGATCGTTGACCAAGAACAGGT 
GAAGCATCGTAACAAAGAGAATGATATGCATAAGTGTACAATTTGCGATCAAATGTTTGGGA 
CCGGTCAAGCTCTAGGCGGTCACATGAGAAAGCATAGGACGAGCATGATAACCGAGCAATCG 
ATTGTCCCTTCTGTGGTTTATTCCAGACCGGTTTTTAATCGTTGCAGTAGCAGCAAGGAGAT 
CTTGGACTTAAATCTAACTCCATTGGAAAATGATCTTGTGTTAATCTTTGGGAAGAATTTGG 

TTCCACAAATTGATTTGAAGTTTGTGAATTAG 

SEQ ID NO 39: gi_6729511_ref_CAB67667 . 1, 2xC2H2, Arabidopsis 

S^pJdyeesmkhidiveslmmlsrsfvvkqidvkqstgsktnhnnhfecktcnrkfdsfq 

ALGGHRASHKKPKLIVDQEQVKHRNKENDMHKCTICDQMFGTGQALGGHMRKHRTSMITEQS 
IVPSWYSRPVFNRCSSSKEILDLNLTPLENDLVLIFGKNLVPQIDLKFVN 

SEQ ID NO 40: ref CA279020, 2xC2H2, sugar cane 

CCTAACCAGCATTAGCTTTTCAAATCAACAAGCCTCGCCGTGACCGATCGATGGCCATCACC 

CACGACGACTACGTCTCCCTCTGCCTCATGGCGCTCGCAGCCGCGGGAGGCGGAGGCCAAGC 

TGGTTTAACAACGCAGTACGCTCTGAACACGGCTGCCTGGACAGCGACGGCGCAAGAGTCCG 

AGCTCCGCTTCCGGTGCTCCGTCTGTGGCAAGGCCTTCGCGTCGCACCAGGCACTGGGCGGG 

CACAAGGCCAGCCACCGCAAGCCGACGCTCGTACAGGCACATGCGTCGTCCTCAGCCGGAGG 

CGCGGCGTCGTCGTCGGTAACAATGACCTCGGCCGTAGGCAGCAGTGGGCAGGGGAGGCACA 

GGTGCACGGTGTGCCATCGGAGCTTCGCGACGGNGCAAGCGCTCGGCGGGCACAAGAGGTGC 

CATTACTGGGACGGGCTCTCGGTGTCGCTCACCGCGTCGTCGGCGCCATCGGGGTCCGGGTC 

GACCGTCAAGGGCTTTGATCTGAATTTGGTGCCGGTGCCGCCCGCGATGGCCGCCAACGCTG 

CGACAAGGTGGGGAGAGGAGAANNAAGTCANAAACCCTTGGCGGTCAAGAGAAGGCGGCTTG 

CCGGTCCGTCTTGGACCCTAATTTAACGATTTAGAAGTCCTTTTTTTAATAATTAAGAGTTC 

TTTTGAAGAAGGTTGTAAAGTTTTCGAACCTTGTTCTTTTAATGGATTTGGGTGCTGGCGAA 

ATTTTAAAACTGGATTTAAATTTGCGCTCACTCTTTTTTTTTATTTTTTACACCCTTTTTTT 

TTTTTAGAAGAAGA 
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SEQ ID NO 41: gi_18027011_ref_AF254447.1, 2xC2H2, Arabidopsis 
thaliana 

TTCCTTTCTCTTCCTCTCTCTCTCTCTTCACCATGACTGATCCTTATTCCAATTTCTTCACA 

GACTGGTTCAAGTCTAATCCTTTTCACCATTACCCTAATTCCTCCACTAACCCCTCTCCTCA 

TCCTCTTCCTCCTGTTACTCCTCCCTCTTCCTTCTTCTTCTTCCCTCAATCCGGAGACCTCC 

GCCGTCCACCGCCGCCACCAACTCCTCCTCCTTCTCCTCCTCTCCGAGAAGCCCTCCCTCTC 

CTCAGCCTCAGCCCCGCCAACAAACAACAAGACCACCATCACAACCATGACCACCTTATTCA 

AGAACGACCTTCAACCTCCATGGATGTCGACTACGATCATCACCATCAAGATGATCATCATA 

ACCTCGATGACGATGACCATGACGTCACCGTTGCTCTTCACATAGGCCTTCCAAGCCCTAGT 

GCTCAAGAGATGGCCTCTTTGCTCATGATGTCTTCTTCTTCCTCTTCCTCGAGGACCACTCA 

TCATCACGAGGACATGAATCACAAGAAAGACCTCGACCATGAGTACAGCCACGGAGCTGTCG 

GAGGAGGAGAAGATGACGATGAAGATTCAGTCGGCGGAGACGGCGGCTGTAGAATCAGCAGA 

CTCAACAAGGGTCAATATTGGATCCCTACACCTTCTCAGATTCTCATTGGCCCTACTCAGTT 

CTCATGTCCTGTTTGCTTCAAAACCTTCAACAGATACAATAACATGCAGATGCATATGTGGG 

GACATGGATCACAATACAGAAAAGGACCTGAATCTCTAAGGGGAACACAACCAACAGGAATG 

CTAAGGCTTCCGTGCTATTGCTGCGCCCCAGGCTGTCGCAACAACATTGACCATCCAAGGGC 

AAAGCCTCTCAAAGACTTCAGAACCCTTCAAACACATTACAAGAGAAAACATGGGATCAAAC 

CTTTCATGTGTAGGAAATGTGGAAAGGCTTTCGCAGTCCGAGGGGACTGGAGAACACATGAG 

AAGAATTGTGGCAAACTTTGGTATTGCATATGTGGATCTGATTTCAAGCACAAGAGATCTCT 

CAAAGATCACATCAAGGCTTTTGGGAATGGTCATGGAGCCTACGGAATTGATGGGTTTGATG 

AAGAAGATGAGCCTGCCTCTGAGGTAGAACAATTAGACAATGATCATGAGTCAATGCAGTCT 

AAATAGCTTATATATATTACTATAAGTACTAAGTAATTCGGTATATATATTAATTATAAGAA 

ACCTAAATCTATGGACCAAGTTTTGATGGAGGTAGGGCTTTTCAAACTAAAAGCTATATCAT 

CTAATTGATCATAGGAAAAAAATGAATCAAGAGCACTTGGAAAATTTTAAATTGTATCTTTA 

GCTTCCTAGTTAAATTTATTGCAAGACAATGTAGCAGTCTAACCAATGAGGTTCCCAACGGT 

TTATTTCTATTTGTATATTATTTTGTCATTAGCTTCACCTTTCGTTAATTCGAAGGACATAA 

CTTATAAATGTTTAAATTATG 

SEQ ID NO 42: At3g57670, 2xC2H2, Arabidopsis thaliana 

MTDPYSNFFTDWFKSNPFHHYPNSSTNPSPHPLPPVTPPSSFFFFPQSGDLRRPPPPPTPPP 

SPPLREALPLLSLSPANKQQDHHHNHDHLIQEPPSTSMDVDYDHHHQDDHHNLDDDDHDVTV 

ALHIGLPSPSAQEMASLLMMSSSSSSSRTTHHHEDMNHKKDLDHEYSHGAVGGGEDDDEDSV 

GGDGGCRISRLNKGQYWIPTPSQILIGPTQFSCPVCFKTFNRYNNMQMHMWGHGSQYRKGPE 

SLRGTQPTGMLRLPCYCCAPGCRNNIDHPRAKPLKDFRTLQTHYKRKHGIKPFMCRKCGKAF 

AVRGDWRTHEKNCGKLWYC ICGS DFKHKRSLKDHIKAFGNGHGAYGIDGFDEEDEPASEVEQ 

LDNDHESMQSK 

SEQ ID NO 43: gi_18676370_ref_AJ311810 .2, 2xC2H2, Arabidopsis 
thaliana 

ATCTACACACTACTACTCACATCTCATCTCTCTCTAGCACATACCCATCAAACCATATAGAT 
ACGGTGCTTTTATTCTTGATCTTCTTCTTCTTCTTTGTCTTCTCCTCAGAGTCATGTCTAAT 
CCAGCTTGTTCGAATCTCTTCAACAATGGATGTGACCATAATAGCTTCAACTATTCCACTTC 
TCTCTCTTACATTTACAACTCTCACGGTAGCTACTATTACTCTAATACCACAAACCCTAATT 
ACATTAATCATACTCATACCACTTCCACTTCCCCTAACTCACCCCCACTAAGAGAAGCTCTT 
CCTCTTCTTAGCTTAAGCCCCATAAGGCACCAAGAACAACAAGACCAACACTATTTCATGGA 
CACCCATCAAATTAGCTCTTCAAACTTTCTTGATGATCCTCTTGTGACTGTGGATCTTCATC 
TAGGGTTACCAAACTACGGTGTTGGTGAGAGCATTAGGAGCAATATTGCTCCTGATGCAACC 
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ACGGACGAGCAAGATCAAGATCATGACCGAGGAGTAGAAGTCACAGTTGAGTCCCACCTTGA 
TGATGATGATGATCATCATGGAGATCTACACAGAGGTCATCACTATTGGATTCCTACTCCTT 
CTCAGATTTTGATTGGTCCTACACAGTTCACTTGTCCTCTTTGCTTCAAGACATTCAACAGA 
TACAACAACATGCAGATGCACATGTGGGGACACGGCTCACAATACAGAAAGGGACCAGAATC 
CTTAAGAGGAACCCAACCAACAGGAATGCTAAGACTACCATGTTTCTGCTGTGCACCCGGTT 
GCAAGAACAACATTGACCACCCACGAGCCAAGCCTCTTAAGGACTTTCGAACCCTCCAAACA 
CATTACAAACGTAAACATGGGTCTAAACCATTTGCTTGTCGTATGTGTGGTAAGGCCTTTGC 
AGTGAAAGGAGATTGGAGAACGCATGAGAAGAATTGTGGAAAGCTTTGGTATTGCTCTTGTG 
GCTCGGATTTTAAGCACAAGAGGTCGCTTAAGGACCATGTCAAGGCCTTTGGAAATGGTCAT 
GTTCCTTGTGGGATTGATAGTTTTGGAGGAGATCATGAGGACTACTATGATGCTGCTTCTGA 
TATCGAGCAATAAGATGATAGCAACAACAATGAGTGTTAATTAGGGGTTTTGTTTATTTTTC 
CTCTCATGCATTAGTTGATTGTATGCACGTGTTCTTTAGTTTTGTTCTTCGGATCTTTGTTT 
TATTTTGTTTTGAGCTGTTTTTTTTTTAATTACTAAGAAGTTAATTATCATCTAAAGATTTT 

C 

SEQ ID NO 44: gi_18376498_ref_CAC86167 .1, 2xC2H2, Arabidopsis 
thaliana 

MSNPACSNLFNNGCDHNSFNYSTSLSYIYNSHGSYYYSNTTNPNYINHTHTTSTSPNSPPLR 
EALPLLSLSPIRHQEQQDQHYFMDTHQISSSNFLDDPLVTVDLHLGLPNYGVGESIRSNIAP 
DATTDEQDQDHDRGVEVTVESHLDDDDDHHGDLHRGHHYWIPTPSQILIGPTQFTCPLCFKT 
FNRYNNMQMHMWGHGSQYPJCGPESLRGTQPTGMLRLPCFCCAPGCKNNIDHPRAKPLKDFRT 
LQTHYKRKHGSKPFACRMCGKAFAVKGDWRTHEKNCGBCLWYCSCGSDFKHKRSLKDHVKAFG 

NGHVPCGI DS FGGDHEDYY DAAS DIEQ 

SEQ ID NO 45: gi_7798991_ref_AL355775. l_region 7957 ... 8451, 

2xC2H2, Arabidopsis thaliana „„„,„,„„,„, 
ATGGTTGCGAGAAGTGAGGAAGTTGAGATAGTGGAAGATACGGCGGCGAAATGTTTGATGTT 

GTTATCAAGAGTTGGAGAATGCGGCGGAGGAGGAGAGAAACGAGTTTTCCGATGCAAGACTT 

GTCTTAAAGAGTTTTCGTCGTTTCAAGCTTTGGGAGGTCATCGTGCAAGCCACAAGAAACTC 

ATTAACAGTAGCGATCCATCACTTCTTGGATCCTTGTCTAACAAGAAAACTAAAACGGCGAC 

GTCTCATCCTTGTCCGATATGTGGCGTGGAGTTTCCGATGGGGCAAGCTCTTGGTGGTCACA 

TGAGGAGACATAGGAGTGAGAAAGCCTCACCAGGCACGTTGGTTACACGTTCTTTTTTACCG 

GAGACGACGACGGTGACGACTTTGAAAAAATCGAGTAGTGGGAAGAGAGTGGCTTGTTTGGA 

CTTAGATTCGATGGAGAGTTTAGTCAATTGGAAGTTGGAGTTGGGAAGAACGATTTCTTGA 

SEQ ID NO 46: gi_7798996_ref_CAB90935. 1, 2xC2H2, Arabidopsis 

Sarsee^ivedtaakclmllsrvgecggggekrvfrcktclkefssfqalgghrashkkl 

INSSDPSLLGSLSNKKTKTATSHPCPICGVEFPMGQALGGHMRRHRSEKASPGTLVTRSFLP 
ETTTVTTLKKSSSGKRVACLDLDSMESLVNWKLELGRTIS 

SEQ ID NO 47: gi_9755794_ref_AL391143. 1 _region 31730 ... 
32 938, 2xC2H2, Arabidopsis thaliana 

ATGGAAGACGAACATCAAGATCTCCATAAACCCATTAATGGAGCTTTGCGAGACCTCAAGAT 
TACTCGGTCACAGAAAGAAACAGAAAAGTCTACGAACCAACAGCAAGATGTTACTTGTTACT 
ATGGTCTAAGGGAAAACTCGAAGAAGAAAACCCAGGAATCTCCGGAACCAATGAAGAAGATT 
TTGTTTCGATGCGAAGAATGTGGAAAAGGGTTTCGGTACGAGAAATATTTTAAGAATCATCG 
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CTCGATGATGCATTTATCGCCGAACGAGAAGGTTTGTGAAGAATCCTTGATGACTCTGTCTC 
GTAGCCTTGGGTTTGTGAAGAAGAAGAAAAGATCAAGACTTGGTAGGTCTGGGAAGACTTTA 
TTTACTACGTTTCTTGAACCGAGTTCTATTTTTGATGCGACTGATGAAGAATTAGAAGTGGC 
GGATTGTTTGATTCTATTGTCTAAGAGTGCTCCCAAGGTTGTAGACGAATTGAAAAGTCTTT 
CTGAGGCAGTACGTGTTACTCCTGAAACACCTGAAAGTAGCTATGATTTGGGTTGTTTGCTC 
AACAAGAAACCGAGAAAAGGTGGTGAATTGGAATCTGGGGTTTTAAGTAATGAGCAAAGACT 
TATGGAAGAAGGGTTTAGTAGTTATGGAACATCGAAAGAACCAGCTAGCTTCTTGAGAGACG 
AAAACAGATTGGATCAGCAGAAACGGAGAAAAGATGGTGAATTTGAATCCGGACTTTTGAGT 
AATGAGCAAAGACTGCTAGAAGAAGAGATTACTACTCCTGTGACATTCAAAGGTCCAGCGAG 
TTCCTTGAGACACAAGTGTGCTTTGGATCGAAATGGAGGTGAATTTGGTCCTGAGTTTTTGA 
GTAATGAGCAAACACTGATGGAAGAAACATGGAAAGAACCAGTGAGTTTCTTAGAAGATAAG 
CATGAATTTGATCAGCGGAAAATGCGAGAAGCTGGCGACTTTGAATCTAGGTTTTACAGAAT 
TGAGCTTGGAGTAGGAGCTATGGAGTGTACTTCTTCAGATACTGATATGCTCACGCAATCTG 
ATAAGAAGAACGTTGAGCATCGATGCAGGTTGTGCAACAAGATATTCTCGTCTTATCAAGCT 
CTAGGGGGTCATCAGACGTTTCATCGGATGAGCAAATGTAAGAACAAGAAGAATGGCATAGA 

GGAATCAGTTGAACCCAGGATGACTCTGTGA 

SEQ ID KO 48: gi_9755803_ref_CAC01747 . 1, 2xC2H2, Arabidopsis 

^DEHQDLHKPINGALRDI^ITRSQKETEKSTNQQQDVTCYYGLRENSKKKTQESPEPMKKI 
LFRCEECGKGFRYEKYFKNHRSMMHLSPNEKVCEESLMTLSRSLGFVKKKKRSRLGRSGKTL 

FTTFLEPS S I FDATDEELEVADCLI LLSKSAPKWDELKS LSEAVRVT PETPESS YDLGCLL 
NKKPRKGGELESGVLSNEQRLMEEGFSSYGTSKEPASFLRDENRLDQQKRRKDGEFESGLLS 
NEQRLLEEEITTPVTFKGPASSLRHKCALDRNGGEFGPEFLSNEQTLMEETWKEPVSFLEDK 
HE FDQRKMRE AGDFES RF YRI ELGVGAMECT S S DT DMLT QS DKKNVEHRCRLCNKI FS S YQA 
LGGHQTFHRMSKCKNKKNGIEESVEPRMTL 

SEQ ID NO 49: gi_1418338_ref_X98678 . 1, 2xC2H2, Arabidopsis 

CTTGTTAGTTCACTCCACATAATAAACACCAAAGATTTCATTCTCTTCTCCATAATTTCGAA 

GTTTCTTGAATTGGGTTTGTTTCTTGATTTGTTTCTTGAATTGGGTTTTGGTCTTCTTTTCT 

TACTATATTTGGATATGATGATGGGTCAAGATGAGGTTGGGAGTGATCAGACGCAAATCATA 

AAAGGGAAACGTACGAAGCGACAAAGATCGTCTTCGACGTTTGTGGTGACGGCGGCGACAAC 

AGTGACTTCAACAAGTTCATCGGCCGGTGGAAGTGGAGGAGAAAGAGCTGTTTCAGATGAAT 

ACAACTCGGCGGTTTCGTCTCCGGTGACTACTGATTGTACGCAAGAAGAAGAAGACATGGCG 

ATTTGTCTCATCATGTTAGCTCGTGGGACAGTTCTTCCATCGCCGGATCTCAAGAACTCGAG 

AAAAATTCATCAGAAGATTTCGTCGGAGAATTCTAGTTTCTATGTGTACGAGTGTAAAACGT 

GTAACCGGACGTTTTCGTCGTTCCAAGCACTTGGTGGACACAGAGCGAGCCACAAGAAGCCG 

AGGACGTCGACTGAGGAAAAGACTAGACTACCCCTGACGCAACCCAAGTCTAGTGCATCAGA 

AGAAGGGCAAAACAGTCATTTCAAAGTTTCCGGCTCAGCCCTAGCTTCACAGGCAAGTAACA 

TCATCAACAAGGCAAACAAAGTACACGAGTGTTCCATCTGCGGTTCTGAGTTCACTTCCGGG 

CAAGCTCTCGGTGGTCACATGAGGCGGCACAGGACAGCCGTAACCACGATTAGCCCCGTTGC 

AGCCACCGCAGAAGTAAGCAGAAACAGTACAGAGGAAGAGATTGAGATCAATATAGGCCGTT 

CGATGGAACAGCAGAGGAAATATCTACCGTTGGATCTTAATCTACCAGCACCAGGAGATGAT 

CTAAGAGAGTCCAAGTTTCAAGGGATAGTATTCTCAGCAACACCAGCGTTAATAGATTGTCA 

TTACTAGTTGTTTTTTTTACTACATAATATGATGAAATATTTGTGAATTCTTCTTACTTACT 

ACTATATTGTTGATCAAAAAAAAAAAAAAAAAA 
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SEQ ID NO SO: gi_1418339_ref_CAA67236.1, 2xC2H2, Arabidopsis 

MGQDEVGSDQTQIIKGKRTKRQRSSSTFWTAATTVTSTSSSAGGSGGERAVSDEYNSAVSS 
PWTDCTQEEEDMAICLIMLARGTVLPSPDLKNSRKIHQKISSENSSFYVYECKTCNRTFSS 
FQALGGHRASHKKPRTSTEEKTRLPLTQPKSSASEEGQNSHFKVSGSALASQASNIINKANK 
VHECSICGSEFTSGQALGGHMRRHRTAVTTISPVAATAEVSRNSTEEEIEINIGRSMEQQRK 

YLPLDLNLPAPGDDLRESKFQGIVFSATPALIDCHY 
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SEQUENCE LISTING 

<110> CropDesign N.V. 

<120> Plants having modified growth characteristics and a method for 
making the same 

<130> CD-070-PCT 

<160> 50 

<170> Patentln version 3,1 

<210> 1 

<211> 692 

<212> DNA 

<213> Arabidopsis thaliana 

<400> 1 C(\ 

aatggcgctc gaggctctta catcaccaag attagcttct ccgattcctc ctttgttcga 60 

agattcttca gtcttccatg gagtcgagca ctggacaaag ggtaagcgat ctaagagatc 120 

aagatccgat ttccaccacc aaaacctcac tgaggaagag tatctagctt tttgcctcat 180 

gcttctcgct cgcgacaacc gtcagcctcc tcctcctccg gcggtggaga agttgagcta 240 

caagtgtagc gtctgcgaca agacgttctc ttcttaccaa gctctcggtg gtcacaaggc 300 

aagccaccgt aagaacttat cacagactct ctccggcgga ggagatgatc attcaacctc 360 

gtcggcgaca accacatccg ccgtgactac tggaagtggg aaatcacacg tttgcaccat 420 

ctgtaacaag tcttttcctt ccggtcaagc tctcggcgga cacaagcggt gccactacga 480 

aggaaacaac aacatcaaca ctagtagcgt gtccaactcc gaaggtgcgg ggtccactag 540 

ccacgttagc agtagccacc gtgggtttga cctcaacatc cctccgatcc ctgaattctc 600 

gatggtcaac ggagacgacg aagtcatgag ccctatgccg gcgaagaagc ctcggtttga 660 

ctttccggtc aaacttcaac tttaaggaaa tt 692 

<210> 2 

<211> 227 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 2 

Met Ala Leu Glu Ala Leu Thr Ser Pro Arg Leu Ala Ser Pro lie Pro 
x 5 10 15 

Pro Leu Phe Glu Asp Ser Ser Val Phe His Gly Val Glu His Trp Thr 
20 25 ' 30 

Lys Gly Lys Arg Ser Lys Arg Ser Arg Ser Asp Phe His His Gin Asn 
35 40 45 

Leu Thr Glu Glu Glu Tyr Leu Ala Phe Cys Leu Met Leu Leu Ala Arg 
50 55 60 

Asp Asn Arg Gin Pro Pro Pro Pro Pro Ala Val Glu Lys Leu Ser Tyr 
65 70 75 80 

Lys Cys Ser Val Cys Asp Lys Thr Phe Ser Ser Tyr Gin Ala Leu Gly 
85 90 95 

Gly His Lys Ala Ser His Arg Lys Asn Leu Ser Gin Thr Leu Ser Gly 
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100 



105 



110 



Glv Gly Asp Asp His Ser Thr Ser Ser Ala Thr Thr Thr Ser Ala Val 
115 120 125 

Thr Thr Gly Ser Gly Lys Ser His Val Cys Thr lie Cys Asn Lys Ser 
130 135 140 

Phe Pro Ser Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu 



Gly Asn Asn Asn lie Asn Thr Ser Ser Val Ser Asn Ser Glu Gly Ala 
165 170 175 

Gly Ser Thr Ser His Val Ser Ser Ser His Arg Gly Phe Asp Leu Asn 
180 185 190 

lie Pro Pro He Pro Glu Phe Ser Met Val Asn Gly Asp Asp Glu Val 
195 200 205 

Met Ser Pro Met Pro Ala Lys Lys Pro Arg Phe Asp Phe Pro Val Lys 
210 215 220 

Leu Gin Leu 
225 

<210> 3 
<211> 50 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer PRM3204 
<400> 3 

ggggacaagt ttgtacaaaa aagcaggctt cacaatggcg ctcgaggctc 

<210> 4 

<211> 53 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer PRM3205 

<400> 4 

ggggaccact ttgtacaaga aagctgggta atttccttaa agttgaagtt tga 

<210> 5 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> QALGGH motif 

<400> 5 

Gin Ala Leu Gly Gly His 



145 



150 



155 



160 
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1 



5 



<210> 6 

<211> 6 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> NNM box 
<220> 

<221> MIS COFEATURE 

<222> <3)..(3) 

<223> Xaa can be either methionine or tryptophan 

<400> 6 

Asn Asn Xaa Gin Met His 
1 5 

<210> 7 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> EAR motif 
<220> 

<2 2 1 > MISC_FEATURE 

<222> <1).-U) 

<223> Xaa can be any hydrophobic amino acid (Ala, Cys, Phe, Gly, His, I 
le, Lys, Leu, Met, Arg, Thr, Val, Trp or Tyr) 

<220> 

<221> MI S COFEATURE 

<222> (5).. (5) 

<223> Xaa can be any hydrophobic amino acid (Ala, Cys, Phe, Gly, His, I 
le, Lys, Leu, Met, Arg, Thr, Val, Trp or Tyr) 

<220> 

<221> MISC_FEATURE 
<222> (6)., (6) 

<223> Xaa can be any amino acid or no amino acid 

<4Q0> 7 

Xaa Asp Leu Asn Xaa Xaa Pro 
1 5 

<210> 8 

<211> 7 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> B-Box 



<220> 

<221> MISC FEATURE 
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<222> (3) • . (3) 

<223> Ser can be serine or no amino acid 
<220> 

<2 2 1> MI SC — FEATURE 

<222> (6).. (6) 

<223> Xaa can be any amino acid 

<400> 8 

Lys Arg Ser Lys Arg Xaa Arg 



<210> 
<211> 
<212> 



9 

12 
PRT 



<213> Artificial sequence 
<220> 

<223> L-Box 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(2) (2) 

Xaa can be any amino acid 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
<4) (5) 

Xaa can be any amino acid 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(7) (7) 

Xaa can be any amino acid 



<220> 
<221> 
<222> 
<223> 



MI S COFEATURE 
(10) . . (11) 

Xaa can be any amino acid 



<400> 9 

Glu Xaa Glu Xaa Xaa Ala Xaa Cys Leu Xaa Xaa Leu 
15 10 

<210> 10 

<211> 1006 

<212> DNA 

<213> Datisca glomerata 



<400> 10 

ggcacgagga 

tattagatgg 

gacgacccca 

cgcttagata 

gtggccgcgt 

aagcaacgac 

cgtcttatca 

atcaatcgac 



caaattctct 
ctctagaagc 
gcttgaatta 
gccccatacc 
tgcctctgca 
ttcggctacc 
ggctttgggt 
ttcctttgcc 



ctctatcctc 
gctcaactct 
ccttgagcca 
gaggaagagt 
aatcgacggg 
aaagtcagtt 
gggcacaagg 
accacgaatt 



tgaatatctt 
ccgaccacag 
tggaccaagc 
accttgcttt 
attctcagtc 
ataagtgctc 
ccagccacag 
cagccaccgt 



tggtttgtga 
ctacgccggt 
gtaagcgttc 
ctgcctcatc 
ttccattcag 
tgtgtgcgat 
aaagctcgct 
cactaccacc 



actgagaagc 
gtttcactac 
caagcgtacg 
atgctcgctc 
attcagcctg 
aaggcctttt 
ggcggcgaag 
acagcctccg 



60 
120 
180 
240 
300 
360 
420 
480 
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gaggtggtgg caggtctcat gagtgttcta tttgccacaa atcgttcccg actggccagg 540 

ccttgggtgg tcacaagcgc tgccactacg aaggcagtat cggcggcaat agtattcacc 600 

accacaacaa taccaccaac agcggaagca acggtggcat gagcatgacc tccgaagtag 660 

gttccacaca cacagtcagc cacagtcacc gtgacttcga tctcaacatc ccggccttgc 720 

cggagtttcg gtcgaatttc ttcatatccg gggatgacga ggtcgagagt cctcatccgg 780 

ccaagaaacc ccgtatattg atgaaataaa acatttctca agatcactga accaggcttt 840 

agtttcttta taggaggaga tttaaaaaag tagtatctct ctttctttat ccgtaggata 900 

attaatatat ttcgtgtaca taaatttgta gttctttaac acactctgtt tcattttctt 960 

gctttgctca actttgtatt ggttatttca ttatgaaaat tcaatt 1006 

<210> 11 
<211> 247 
<212> PRT 

<213> Datisca glomerata 

Met ° Ala 1 Leu Glu Ala Leu Asn Ser Pro Thr Thr Ala Thr Pro Val Phe 
I 5 10 15 

His Tyr Asp Asp Pro Ser Leu Asn Tyr Leu Glu Pro Trp Thr Lys Arg 
20 25 30 

Lys Arg Ser Lys Arg Thr Arg Leu Asp Ser Pro His Thr Glu Glu Glu 
35 40 45 

Tyr Leu Ala Phe Cys Leu lie Met Leu Ala Arg Gly Arg Val Ala Ser 
50 55 60 

Ala Asn Arg Arg Asp Ser Gin Ser Ser lie Gin lie Gin Pro Glu Ala 
65 70 75 80 

Thr Thr Ser Ala Thr Lys Val Ser Tyr Lys Cys Ser Val Cys Asp Lys 
85 90 95 

Ala Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg . 
100 105 HO 

Lys Leu Ala Gly Gly Glu Asp Gin Ser Thr Ser Phe Ala Thr Thr Asn 
115 120 125 

Ser Ala Thr Val Thr Thr Thr Thr Ala Ser Gly Gly Gly Gly Arg Ser 
130 135 140 

His Glu Cys Ser He Cys His Lys Ser Phe Pro Thr Gly Gin Ala Leu 
145 150 155 160 

Glv Gly His Lys Arg Cys His Tyr Glu Gly Ser He Gly Gly Asn Ser 
165 170 175 

He His His His Asn Asn Thr Thr Asn Ser Gly Ser Asn Gly Gly Met 
180 185 190 

Ser Met Thr Ser Glu Val Gly Ser Thr His Thr Val Ser His Ser His 
195 200 205 

Arg Asp Phe Asp Leu Asn He Pro Ala Leu Pro Glu Phe Arg Ser Asn 
210 215 220 
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60 
120 



780 
840 
900 
960 
996 



Phe Phe lie Ser Gly Asp Asp Glu Val Glu Ser Pro His Pro Ala Lys 
225 230 '235 240 

Lys Pro Arg lie Leu Met Lys 
245 

<210> 12 
<211> 996 
<212> DNA 
<213> Glycine max 

<400> 12 

aaaattctca ctctctctct catctcgaga tcatagtatc atattcaata tcatttcata 
ccaaacacat ggctttggaa gctctcaact caccaacaac aaccgctcca tcttttccct 
ttgacgaccc aactattcca tgggcgaaac gaaaacgttc aaagcgttct cgcgaccatc 180 
cttctgaaga agagtacctc gccctctgcc tcatcatgct cgctcgcggc ggcaccacca 240 
ccgtcaacaa ccgccacgtc agccctccgc cgctacagcc acagccacag ccgacaccag 300 
atccttccac caagctcagt tacaaatgct ccgtttgcga caagagcttc ccctcttacc 360 
aagcgctcgg tggacacaag gccagtcacc ggaaactcgc cggcgccgcc gaagaccaac 4Z0 
cccccagcac caccacttcc tccgccgccg ccaccagctc cgcctccgga ggtaaggccc 480 
atgagtgctc catttgccac aaatccttcc ccaccggaca ggcccttggc ggacacaaac 540 
gttgtcacta cgaaggtaac ggtaacggaa ataacaacaa cagtaacagc gttgtcaccg 600 
tcgcctcgga aggcgtgggc tccacccaca ctgtcagtca cggccaccac cgcgacttcg 660 
atctcaacat cccggccttt ccggattttt cgaccaaggt cggagaagac gaggttgaga 720 
gccctcaccc tgtcatgaag aagcctcgcc tcttcgtcat tcccaagatc gaaatccccc /*» 
aatttcaatg aactcgttga atttttagtt tatttttcga ctatatattt tggagaattt 
tgagagttac tataatttga ttttgtacat agtacttgga agttttgttg gaccgtaccg 
gacccagttc tctggttgag gttgtacttt cacaacagtg gcagatttgc aattcaattc 
aatttatttg tttattttaa aaaaaaaaaa aaaaaa 

<210> 13 

<211> 240 

<212> PRT 

<213> Glycine max 

<400> 13 „ 
Met Ala Leu Glu Ala Leu Asn Ser Pro Thr Thr Thr Ala Pro Ser Phe 
X 5 10 15 

Pro Phe Asp Asp Pro Thr He Pro Trp Ala Lys Arg Lys Arg Ser Lys 
20 25 30 

Arg Ser Arg Asp His Pro Ser Glu Glu Glu Tyr Leu Ala Leu Cys Leu 
35 40 45 

He Met Leu Ala Arg Gly Gly Thr Thr Thr Val Asn Asn Arg His Val 
50 55 60 

Ser Pro Pro Pro Leu Gin Pro Gin Pro Gin Pro Thr Pro Asp Pro Ser 
65 70 75 80 

Thr Lys Leu Ser Tyr Lys Cys Ser Val Cys Asp Lys Ser Phe Pro Ser 
85 ~ 90 95 

Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg Lys Leu Ala Gly 
100 105 HO 

Ala Ala Glu Asp Gin Pro Pro Ser Thr Thr Thr Ser Ser Ala Ala Ala 



WO 2004/058980 




T/EP2003/051104 



115 120 125 

Thr Ser Ser Ala Ser Gly Gly Lys Ala His Glu Cys Ser He Cys His 
130 135 140 

Lvs Ser Phe Pro Thr Gly Gin Ala Leu Gly Gly His Lys Arg Cys His 
!4 5 150 155 160 

Tvr Glu Gly Asn Gly Asn Gly Asn Asn Asn Asn Ser Asn Ser Val Val 
165 170 175 

Thr Val Ala Ser Glu Gly Val Gly Ser Thr His Thr Val Ser His Gly 
180 185 190 

His His Arg Asp Phe Asp Leu Asn He Pro Ala Phe Pro Asp Phe Ser 
195 200 205 

Thr Lys Val Gly Glu Asp Glu Val Glu Ser Pro His Pro Val Met Lys 
210 215 220 

Lvs Pro Arg Leu Phe Val He Pro Lys He Glu He Pro Gin Phe Gin 
225 230 235 240 

<210> 14 
<211> 1006 
<212> DNA 

<213> Medicago sativa 
<400> 14 

aattcggcac gagaaataac cacttctctc tcaaaacctc cttttgcctt ttgcttctac 

tttcacttgc gtaacgctaa ctaactcttc tcgagtgttc ttcttttcat catatggcta 120 

tggaagcact taactcaccc accactgcta ctcctttcac accctttgag gaaccaaatc 180 

tgagttatct tgaaacaccg tggacgaaag gtaaacgatc aaagcgttct cgcatggatc 240 

aatcttcatg cactgaagaa gagtatctcg ctctttgtct catcatgctt gctcgcagcg 300 

gtaacaacaa cgacaaaaag tctgattcgg tggcgacgcc gctaaccacc gttaaactca 360 

gtcacaaatg ctcagtctgc aacaaagctt tctcatctta tcaagcccta ggtggacaca 420 

aagccagtca ccggaaagct gttatgtccg caaccaccgc tgaagatcag atcaccacca 480 

cttcatccgc cgtgactacc agctctgctt ccaacggtaa gaacaagact catgagtgtt 540 

ccatctgtca caaatccttc cctactggac aggctttggg aggacacaag cgttgtcact 600 

acgaaggcag cgttggtgcc ggtgccggtg ctggaagtaa cgctgtaact gcctctgaag 660 

gagttggatt gtcacacagc caccaccgtg attttgatct taacctcccg gcttttccgg 720 

acttttcaaa gaagtttttc gtggatgacg aggtttttag tcctttacct gctgcaaaga 780 

agccctgtct tttcaagctg gaaattcctt ctcattactg atcaataata gatccaattt 840 

tattgttatt attattaata attattatcg cttagggcat agttattttc ttttttcttt 900 

caattatttc ggatcaattt gttctgtaca tacaaattgg gattggtttt agaatttagg 960 

acggttgtag acaatggaaa ttcaattcaa ttatttaatt ttgtgt 1006 

<210> 15 
<211> 235 
<212> PRT 

<213> Medicago sativa 

<400> 15 ^ 

Met Ala Met Glu Ala Leu Asn Ser Pro Thr Thr Ala Thr Pro Phe Thr 

15 10 15 

Pro Phe Glu Glu Pro Asn Leu Ser Tyr Leu Glu Thr Pro Trp Thr Lys 
20 25 30 



60 
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Gly Lys Arg Ser Lys Arg Ser Arg Met Asp Gin Ser Ser Cys Thr Glu 
35 4Q 45 

Glu Glu Tyr Leu Ala Leu Cys Leu lie Met Leu Ala Arg Ser Gly Asn 
50 55 60 

Asn Asn Asp Lys Lys Ser Asp Ser Val Ala Thr Pro Leu Thr Thr Val 
65 ~ 70 75 80 

Lvs Leu Ser His Lys Cys Ser Val Cys Asn Lys Ala Phe Ser Ser Tyr 
85 90 95 

Gin Ala Leu Gly Gly His Lys Ala Ser His Arg Lys Ala Val Met Ser 
100 105 HO 

Ala Thr Thr Ala Glu Asp Gin lie Thr Thr Thr Ser Ser Ala Val Thr 
115 120 125 

Thr Ser Ser Ala Ser Asn Gly Lys Asn Lys Thr His Glu Cys Ser He 
130 135 140 

Cvs His Lys Ser Phe Pro Thr Gly Gin Ala Leu Gly Gly His Lys Arg 
145 150 155 160 

Cys His Tyr Glu Gly Ser Val Gly Ala Gly Ala Gly Ala Gly Ser Asn 
165 170 175 

Ala Val Thr Ala Ser Glu Gly Val Gly Leu Ser His Ser His His Arg 
180 185 190 

Asp Phe Asp Leu Asn Leu Pro Ala Phe Pro Asp Phe Ser Lys Lys Phe 
195 200 205 

Phe Val Asp Asp Glu Val Phe Ser Pro Leu Pro Ala Ala Lys Lys Pro 
210 215 220 

Cys Leu Phe Lys Leu Glu lie Pro Ser His Tyr 
225 230 235 

<210> 16 

<211> 1061 

<212> DNA 

<213> Nicotiana tabacum 

<400> 16 Cf\ 

ttttccctcg aatttgataa ctaaagagaa tattatgact cttgaagctt tgaagtcacc 60 

tacggcggca acgccgactc taccaccacg ctatgaagat gatgatgaaa ttcataattt 120 

ggattcttgg gctaaaggaa aacgatcaaa acggccccgt attgatgccc caccgactga 180 

agaagagtat ttagccctct gtctcatcat gctcgctcgc agcggaaccg gaaccagaac 240 

cggtttaact gatgctacta cttcccaaca acctgccgat aaaaaaaccg ccgagttgcc 300 

gccggttcat aagaaagagg tggcaacaga gcaagcagag caatcttaca agtgtagcgt 360 

gtgtgacaag gctttttctt cttatcaagc actcggtggg cataaagcaa gtcaccgtaa 420 

aactactact actgctaccg ccgcctctga tgataacaat ccttcaactt caacttccac 480 

tggcgccgtt aatatctctg ctcttaatcc aactggtcgt tcacacgtct gttctatttg 540 

ccacaaggct tttcctactg gccaagcttt gggtgggcac aagcgccgcc actatgaagg 600 

caaactcggt ggtaacagcc gcgacttagg cggcggcggc ggcggcggtc atagtggaag 660 

cgtcttgact acttcagacg gcggcgcgtc gactcacacg ctacgtgact ttgacctgaa 720 
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catgcctgct tcgccggaat tgcaactggg tctgagtatt gattgtggac ggaaaagtca 780 

actgttgccg atggtccaag aggtggaaag tcctatgcct gcaaagaaac cgcgtttatt 840 

gttttcgttg ggttgaaact tctttagggg aattgaattg attgtgtttt agccaaatta 900 

gtaaattggt tcatgtgatt ttatttttag gaaaaggaat tattgattgt tttacccgtt 960 

tattcttagg gtggtattat gtacagggag tgaatcattc attggtttta cactttctta 1020 

attatatatt cttttttttt acacataaaa aaaaaaaaaa a 1061 

<210> 17 

<211> 273 

<212> PRT 

<213> Nicotiana tabacum 

<400> 17 

Met Thr Leu Glu Ala Leu Lys Ser Pro Thr Ala Ala Thr Pro Thr Leu 
I 5 10 15 

Pro Pro Arg Tyr Glu Asp Asp Asp Glu He His Asn Leu Asp Ser Trp 
20 25 30 

Ala Lys Gly Lys Arg Ser Lys Arg Pro Arg He Asp Ala Pro Pro Thr 
35 40 45 

Glu Glu Glu Tyr Leu Ala Leu Cys Leu He Met Leu Ala Arg Ser Gly 
50 55 60 

Thr Gly Thr Arg Thr Gly Leu Thr Asp Ala Thr Thr Ser Gin Gin Pro 
65 " 70 75 80 

Ala Asp Lys Lys Thr Ala Glu Leu Pro Pro Val His Lys Lys Glu Val 
85 90 95 

Ala Thr Glu Gin Ala Glu Gin Ser Tyr Lys Cys Ser Val Cys Asp Lys 
100 105 HO 

Ala Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg 
115 120 125 

Lys Thr Thr Thr Thr Ala Thr Ala Ala Ser Asp Asp Asn Asn Pro Ser 
130 135 140 

Thr Ser Thr Ser Thr Gly Ala Val Asn He Ser Ala Leu Asn Pro Thr 
145 150 155 160 

Glv Arg Ser His Val Cys Ser He Cys His Lys Ala Phe Pro Thr Gly 
165 170 175 

Gin Ala Leu Gly Gly His Lys Arg Arg His Tyr Glu Gly Lys Leu Gly 
180 185 190 

Gly Asn Ser Arg Asp Leu Gly Gly Gly Gly Gly Gly Gly His Ser Gly 
195 ~ 200 205 

Ser Val Leu Thr Thr Ser Asp Gly Gly Ala Ser Thr His Thr Leu Arg 
210 215 220 

Asp Phe Asp Leu Asn Met Pro Ala Ser Pro Glu Leu Gin Leu Gly Leu 
225 230 235 240 
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Ser lie Asp Cys Gly Arg Lys Ser Gin Leu Leu Pro Met Val Gin Glu 
245 250 255 

Val Glu Ser Pro Met Pro Ala Lys Lys Pro Arg Leu Leu Phe Ser Leu 
260 265 270 



<210> 18 

<211> 1213 

<212> DNA 

<213> Oryza sativa 

<400> 18 

aattcggcac gaggccacac agcaaccagc cagctgccac actagcttga ggcgagcgag 60 

cgaagcttag ctagcggata gaacaagtcg tcgatctgct tgctgctttt gtgaattgcg 120 

gtggaagcat gtcgagcgcg tcgtccatgg aagcgctcca cgccgcggtg ctcaaggagg 180 

agcagcagca gcacgaggtg gaggaggcga cggtcgtgac gagcagcagc gccacgagcg 240 

gggaggaggg cggacacctg ccccaggggt gggcgaagcg gaagcggtcg cgccgccagc 300 

gatcggagga ggagaacctc gcgctctgcc tcctcatgct cgcccgcggc ggccaccacc 360 

gcgtccaggc gccgcctccg ctctcggctt cggcgccccc gccggcaggt gcggagttca 420 

agtgctccgt ctgcggcaag tccttcagct cctaccaggc gctcggcggc cacaagacga 480 

gccaccgggt caagctgccg actccgcccg cagctcccgt cttggctccc gcccccgtcg 540 

ccgccttgct gccttccgcc gaggaccgcg agccagccac gtcatccacc gccgcgtcct 600 

ccgacggcat gaccaacaga gtccacaggt gttccatctg ccagaaggag ttccccaccg 660 

ggcaggcgct cggcgggcac aagaggaagc actacgacgg tggcgtaggc gccggcgccg 720 

gcgcatcttc aaccgagctc ctggccacgg tggccgccga gtccgaggtg ggaagctccg 780 

gcaacggcca gtccgccacc cgggcgttcg acctcaacct cccggccgtg ccggagttcg 840 

tgtggcggcc gtgctccaag ggcaagaaga tgtgggacga ggaggaggag gtccagagcc 900 

ccctcgcctt caagaagccc cggcttctca ccgcgtaatt cagcagctgc acggatccga 960 

tccgtcagag tttttgtcta gggagtgaaa ttcagtcgaa acacactatt cgttgattcg 1020 

ttttgtgccg ctattgttta atttgttcct gcttttgtac agagcaagcg agtgatacat 1080 

agccatacat acagtcatac agatataggt ctagctcttc cttggttctt tgtaacactg 1140 

gaactgtacc tgtatctttt acactttgtt ctttgacagt catatattgt agaccaaaaa 1200 
aaaaaaaaaa aaa 

<210> 19 

<211> 269 

<212> PRT 

<213> Oryza sativa 

<400> 19 

Met Ser Ser Ala Ser Ser Met Glu Ala Leu His Ala Ala Val Leu Lys 
1 5 10 15 

Glu Glu Gin Gin Gin His Glu Val Glu Glu Ala Thr Val Val Thr Ser 
20 25 30 

Ser Ser Ala Thr Ser Gly Glu Glu Gly Gly His Leu Pro Gin Gly Trp 
35 40 45 

Ala Lys Arg Lys Arg Ser Arg Arg Gin Arg Ser Glu Glu Glu Asn Leu 
50 55 60 

Ala Leu Cys Leu Leu Met Leu Ala Arg Gly Gly His His Arg Val Gin 
65 A 70 75 80 



1213 
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Ala Pro Pro Pro Leu Ser Ala Ser Ala Pro Pro Pro Ala Gly Ala Glu 
85 90 95 

Phe Lys Cys Ser Val Cys Gly Lys Ser Phe Ser Ser Tyr Gin Ala Leu 
100 105 HO 

Gly Gly His Lys Thr Ser His Arg Val Lys Leu Pro Thr Pro Pro Ala 
115 120 125 

Ala Pro Val Leu Ala Pro Ala Pro Val Ala Ala Leu Leu Pro Ser Ala 
130 135 140 

Glu Asp Arg Glu Pro Ala Thr Ser Ser Thr Ala Ala Ser Ser Asp Gly 
14 5 150 155 160 

Met Thr Asn Arg Val His Arg Cys Ser He Cys Gin Lys Glu Phe Pro 
165 170 175 

Thr Gly Gin Ala Leu Gly Gly His Lys Arg Lys His Tyr Asp Gly Gly 
180 185 190 

Val Gly Ala Gly Ala Gly Ala Ser Ser Thr Glu Leu Leu Ala Thr Val 
195 200 205 

Ala Ala Glu Ser Glu Val Gly Ser Ser Gly Asn Gly Gin Ser Ala Thr 
210 215 220 

Ara Ala Phe Asp Leu Asn Leu Pro Ala Val Pro Glu Phe Val Trp Arg 
225 230 235 240. 

Pro Cys Ser Lys Gly Lys Lys Met Trp Asp Glu Glu Glu Glu Val Gin 
245 250 255 

Ser Pro Leu Ala Phe Lys Lys Pro Arg Leu Leu Thr Ala 
260 265 

<210> 20 
<211> 1020 
<212> DNA 

<213> Petunia x hybrida 



60 
120 



<400> 20 

ttcactcacc aaaacaactt ctctacctct tctacttgca cattcaaatt ctttcattac 
tacttatctc tactaatctt gattcgattt tagtaaatca aacaagagaa tcttttcagt 

aatacaaaca agaaaatttt ctctctatac ttgattgagt ttagtaaggc aaacaagaaa 180 

actatcatgg cacttgaagc attgaattct ccaactacaa caacaccacc atcattccaa 240 

tttgagaaca acgggcttaa gtaccttgag agttggacaa aaggtaaaag atcaaaaagg 300 

caacgcagca tggaacgaca gtgtactgaa gaagagtatt tagcactttg tcttatcatg 360 

ctagcacgta gcgatggttc tgttaataac tcacggtctc taccaccacc accactacca 420 

ccatcagttc cagtaacgtc gcaaataaac gcgacgttat tggaacagaa gaatttgtac 480 

aagtgttccg tttgtggtaa agggtttggg tcttatcaag ctttaggtgg acataaagca 540 

agtcaccgga aacttgtcag catgggagga gatgaacaat ctactacttc cactactact 600 

aacgtaacgg gaactagttc cgctaacgtt aacggtaacg gaagaactca cgaatgttca 660 

atttgtcaca agtgctttcc tactggacaa gctttaggtg gtcataaaag gtgccactat 720 

gacggtggta acggtaacgg taacggaagt gtaagtgttg gggtgacgtc atctgaaggt 780 

gtggggtcca ctattagtca tcaccgtgac tttgacttga atattcccgc gttgccggag 840 
ttttggccgg gatttggttc cggcgaggat gaggtggaga gtcctcatcc agcaaagaag 



840 
900 
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tcaaggctat ctcttccacc taaacttgaa ttattcaaag gattatagag ggaatattga 960 
tttgttacag gaagatttat taggattcac gaattttttg ttgactagtt tatgtaatat 1020 

<210> 21 

<211> 253 

<212> PRT 

<213> Petunia x hybrida 

<400> 21 „ 

Met Ala Leu Glu Ala Leu Asn Ser Pro Thr Thr Thr Thr Pro Pro Ser 

X 5 10 15 

Phe Gin Phe Glu Asn Asn Gly Leu Lys Tyr Leu Glu Ser Trp Thr Lys 
20 25 30 

Gly Lys Arg Ser Lys Arg Gin Arg Ser Met Glu Arg Gin Cys Thr Glu 
35 40 45 

Glu Glu Tyr Leu Ala Leu Cys Leu lie Met Leu Ala Arg Ser Asp Gly 
50 55 60 

Ser Val Asn Asn Ser Arg Ser Leu Pro Pro Pro Pro Leu Pro Pro Ser 
65 70 75 80 

Val Pro Val Thr Ser Gin He Asn Ala Thr Leu Leu Glu Gin Lys Asn 
85 90 95 

Leu Tyr Lys Cys Ser Val Cys Gly Lys Gly Phe Gly Ser Tyr Gin Ala 
100 105 HO 

Leu Gly Gly His Lys Ala Ser His Arg Lys Leu Val Ser Met Gly Gly 
115 120 125 

Asp Glu Gin Ser Thr Thr Ser Thr Thr Thr Asn Val Thr Gly Thr Ser 
130 135 140 

Ser Ala Asn Val Asn Gly Asn Gly Arg Thr His Glu Cys Ser He Cys 
145 150 155 160 

His Lys Cys Phe Pro Thr Gly Gin Ala Leu Gly Gly His Lys Arg Cys 
165 170 175 

His Tyr Asp Gly Gly Asn Gly Asn Gly Asn Gly Ser Val Ser Val Gly 
180 185 190 

Val Thr Ser Ser Glu Gly Val Gly Ser Thr He Ser His His Arg Asp 
195 200 205 

Phe Asp Leu Asn He Pro Ala Leu Pro Glu Phe Trp Pro Gly Phe Gly 
210 215 220 

Ser Gly Glu Asp Glu Val Glu Ser Pro His Pro Ala Lys Lys Ser Arg 
225 230 235 240 

Leu Ser Leu Pro Pro Lys Leu Glu Leu Phe Lys Gly Leu 
245 250 
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786 



<210> 22 
<211> 786 
<212> DNA 

<213> Triticum aestivum 
<400> 22 

atgtcgtcgt cggccatgga agcgctccac gccctgatcc cggagcagca ccagctggac 
gttgaggcgg ctgcggctgt cagcagcgcc accagcggcg aggagagcgg ccacgtgctg 120 
caggggtggg ccaagaggaa gcgatcgcgc cgccagcgct ccgaggagga gaacctcgcg 180 
ctctgcctcc tcatgctctc gcgcggcggc aagcagcgtg ttcaggcgcc gcagccggag 240 
tcgttcgctg cgccggtgcc tgccgagttc aagtgctccg tctgcggcaa gtccttcagc 300 
tcctaccagg cgctcggagg ccacaagacg agccaccggg tgaagcagcc gtctcctccc 360 
tctgatgccg ctgctgcccc actcgtggcc ctcccggccg tcgccgccat cctgccgtcc 420 
gccgagccgg ccacgtcgtc caccgccgcg tcctccgacg gcgcgaccaa cagagtccac 480 
aggtgctcca tctgccaaaa ggagttcccg actgggcagg cgctcggcgg gcacaagagg 540 
aagcactacg acggaggcgt gggcgccgcc gcctcgtcga ccgagcttct ggccgccgcg 600 
gccgccgagt ctgaggtggg gagcaccggc aacgggagct ccgccgcccg ggccttcgac 660 
ctgaacattc cggccgtgcc ggagttcgtg tggaggccgt gcgccaaggg caagatgatg 720 
tgggaggacg atgaggaggt gcagagcccc ctcgccttca agaagcctcg gcttctcacc 780 
gcttga 

<210> 23 
<211> 261 
<212> PRT 

<213> Triticum aestivum 
• <400> 23 

Met Ser Ser Ser Ala Met Glu Ala Leu His Ala Leu lie Pro Glu Gin 
15 10 15 

His Gin Leu Asp Val Glu Ala Ala Ala Ala Val Ser Ser Ala Thr Ser 
20 25 30 

Gly Glu Glu Ser Gly His Val Leu Gin Gly Trp Ala Lys Arg Lys Arg 
35 40 45 

Ser Arg Arg Gin Arg Ser Glu Glu Glu Asn Leu Ala Leu Cys Leu Leu 
50 " 55 60 

Met Leu Ser Arg Gly Gly Lys Gin Arg Val Gin Ala Pro Gin Pro Glu 
65 " 70 75 80 

Ser Phe Ala Ala Pro Val Pro Ala Glu Phe Lys Cys Ser Val Cys Gly 
85 90 95 

Lys Ser Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Thr Ser His 
100 "* 105 HO 

Arg Val Lys Gin Pro Ser Pro Pro Ser Asp Ala Ala Ala Ala Pro Leu 
115 120 125 

Val Ala Leu Pro Ala Val Ala Ala He Leu Pro Ser Ala Glu Pro Ala 
130 135 140 

Thr Ser Ser Thr Ala Ala Ser Ser Asp Gly Ala Thr Asn Arg Val His 
145 150 155 160 

Arg Cys Ser He Cys Gin Lys Glu Phe Pro Thr Gly Gin Ala Leu Gly 
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165 170 175 

Gly His Lys Arg Lys His Tyr Asp Gly Gly Val Gly Ala Ala Ala Ser 
180 185 190 

Ser Thr Glu Leu Leu Ala Ala Ala Ala Ala Glu Ser Glu Val Gly Ser 
195 200 205 

Thr Gly Asn Gly Ser Ser Ala Ala Arg Ala Phe Asp Leu Asn He Pro 
210 215 220 

Ala Val Pro Glu Phe Val Trp Arg Pro Cys Ala Lys Gly Lys Met Met 
225 230 235 240 

Trp Glu Asp Asp Glu Glu Val Gin Ser Pro Leu Ala Phe Lys Lys Pro 
245 250 255 

Arg Leu Leu Thr Ala 
260 

<210> 24 
<211> 1026 
<212> DNA 

<213> Capsicum annum 
<400> 24 

aaaatcttcg ctacttactt acatcttcta gaatagtcac tagaaccagt aactttatac 



60 



540 
600 
660 



aacggatatc gatatggcac ttgaagcttt gaattctcca actggtacac caactccgcc 120 

accgtttcaa tttgagagcg acggccaaca gcttcgatat atcgaaaact ggaggaaggg 180 

aaagagatct aaaaggtcac gcagcatgga gcaccagcct actgaggaag aatacttagc 240 

gctttgtttg atcatgcttg cacgtagcgg tggctccgtt aatcatcaac gatctctacc 300 

accgccggct ccggtgatga aactgcacgc gccgtcgtca tcatcggcgg cggaggagga 360 

gaaggagaag atggtgtata agtgttcggt ttgtggtaag ggatttgggt cttatcaagc 420 

tttaggtgga cacaaagcta gtcaccggaa actcgtaccc ggcggagatg atcagtcaac 480 
tacctccaca accactaacg caaccggaac aacaacctcc gttaacggca acggcaacag 
aagtggaagg actcacgagt gttcgatttg tcacaagtgt tttcccactg gacaagcttt 
aggtggacac aaaaggtgtc actacgacgg cggtatcggt aacggaaacg ctaacagtgg 

cgttagtgct agcgttggag tgacgtcatc ggagggtgtg gggtccacag tcagtcaccg 720 

ggatttcgac ttgaacattc cggcgttgcc ggaattctgg ctgggatttg gttccggcga 780 

agatgaggtg gagagtccac atccggcgaa gaaatcgcgg ttatgtttgc ctccaaaata 840 

tgaattattt caacattaat gggaatttga ttgttaggat ttactatttt ggtagacaaa 900 

attatactat gtaagtttta attttcattg tgggtgggag caaaattttt aattttttgt 960 

ctatagacct agctagttac taatagcaaa aattcaattg attgatttaa aaaaaaaaaa 1020 

1026 

aaaaaa 

<210> 25 
<211> 261 
<212> PRT 

<213> Capsicum annum 
<400> 25 

Met Ala Leu Glu Ala Leu Asn Ser Pro Thr Gly Thr Pro Thr Pro Pro 
15 10 15 

Pro Phe Gin Phe Glu Ser Asp Gly Gin Gin Leu Arg Tyr He Glu Asn 
20 25 30 

Trp Arg Lys Gly Lys Arg Ser Lys Arg Ser Arg Ser Met Glu His Gin 
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35 40 45 

Pro Thr Glu Glu Glu Tyr Leu Ala Leu Cys Leu lie Met Leu Ala Arg 
' 50 55 60 

Ser Gly Gly Ser Val Asn His Gin Arg Ser Leu Pro Pro Pro Ala Pro 
65 " 70 75 80 

Val Met Lys Leu His Ala Pro Ser Ser Ser Ser Ala Ala Glu Glu Glu 
85 90 95 

Lys Glu Lys Met Val Tyr Lys Cys Ser Val Cys Gly Lys Gly Phe Gly 
100 105 HO 

Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg Lys Leu Val 
115 120 125 

Pro Gly Gly Asp Asp Gin Ser Thr Thr Ser Thr Thr Thr Asn Ala Thr 
130 135 140 

Gly Thr Thr Thr Ser Val Asn Gly Asn Gly Asn Arg Ser Gly Arg Thr 
145 150 155 160 

His Glu Cys Ser lie Cys His Lys Cys Phe Pro Thr Gly Gin Ala Leu 
165 170 175 

Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly He Gly Asn Gly Asn 
180 185 190 

Ala Asn Ser Gly Val Ser Ala Ser Val Gly Val Thr Ser Ser Glu Gly 
195 200 205 

Val Gly Ser Thr Val Ser His Arg Asp Phe Asp Leu Asn He Pro Ala 
210 215 220 

Leu Pro Glu Phe Trp Leu Gly Phe Gly Ser Gly Glu Asp Glu Val Glu 
225 230 235 240 

Ser Pro His Pro Ala Lys Lys Ser Arg Leu Cys Leu Pro Pro Lys Tyr 
245 250 255 

Glu Leu Phe Gin His 
260 

<210> 26 
<211> 1068 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 26 

acttcactct ctaatttcct tctctctatc tctcaccata ttcgcgatta aaaactctca 60 
acttttctct caaatttctg atcctttgat ccaacagtta gaagaagatt catctgatca 120 
tggccctcga agcgatgaac actccaactt cttctttcac cagaatcgaa acgaaagaag 180 
atttgatgaa cgacgccgtt ttcattgagc cgtggcttaa acgcaaacgc tccaaacgtc 240 
agcgttctca cagcccttct tcgtcttctt cctcaccgcc tcgatctcga cccaaatccc 
agaatcaaga tcttacggaa gaagagtatc tcgctctttg tctcctcatg ctcgctaaag 
atcaaccgtc gcaaacgcga tttcatcaac agtcgcaatc gttaacgccg ccgccagaat 



300 
360 
420 



caaagaacct tccgtacaag tgtaacgtct gtgaaaaagc gtttccttcc tatcaggctt 480 
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1020 



taggcggtca caaagcaagt caccgaatca aaccaccaac cgtaatctca acaaccgccg 540 

atgattcaac agctccgacc atctccatcg tcgccggaga aaaacatccg attgctgcct 600 

ccggaaagat ccacgagtgt tcaatctgtc ataaagtgtt tccgacgggt caagctttag 660 

gcggtcacaa acgttgtcac tacgaaggca acctcggcgg cggaggagga ggaggaagca 720 

aatcaatcag tcacagtgga agcgtgtcga gcacggtatc ggaagaaagg agccaccgtg 780 

gattcatcga tctaaaccta ccggcgttac ctgaactcag ccttcatcac aatccaatcg 840 
tcgacgaaga gatcttgagt ccgttgaccg gtaaaaaacc gcttttgttg accgatcacg 
accaagtcat caagaaagaa gatttatctt taaaaatcta atactcgact attaattctt 
gtgtgatttt tttcgttaca accatagttt cattttcatt tttttagtta caaattttta 

attgttctga tttggattga atattggtat attgttaggg gttgatac 1068 

<210> 27 
<211> 273 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 27 

Met Ala Leu Glu Ala Met Asn Thr Pro Thr Ser Ser Phe Thr Arg lie 
15 10 15 

Glu Thr Lys Glu Asp Leu Met Asn Asp Ala Val Phe He Glu Pro Trp 
20 25 30 

Leu Lys Arg Lys Arg Ser Lys Arg Gin Arg Ser His Ser Pro Ser Ser 
35 40 45 

Ser Ser Ser Ser Pro Pro Arg Ser Arg Pro Lys Ser Gin Asn Gin Asp 
50 55 60 

Leu Thr Glu Glu Glu Tyr Leu Ala Leu Cys Leu Leu Met Leu Ala Lys • 
65 70 75' 80 

Asp Gin Pro Ser Gin Thr Arg Phe His Gin Gin Ser Gin Ser Leu Thr 
85 90 95 

Pro Pro Pro Glu Ser Lys Asn Leu Pro Tyr Lys Cys Asn Val Cys Glu 
100 105 HO 

Lys Ala Phe Pro Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His 
115 120 125 

Arg He Lys Pro Pro Thr Val He Ser Thr Thr Ala Asp Asp Ser Thr 
130 135 140 

Ala Pro Thr He Ser He Val Ala Gly Glu Lys His Pro He Ala Ala 
145 150 155 160 

Ser Glv Lys He His Glu Cys Ser He Cys His Lys Val Phe Pro Thr 
165 170 175 

Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly Asn Leu 
180 185 190 

Gly Gly Gly Gly Gly Gly Gly Ser Lys Ser He Ser His Ser Gly Ser 
195 200 205 

Val Ser Ser Thr Val Ser Glu Glu Arg Ser His Arg Gly Phe He Asp 
210 215 220 
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Leu Asn Leu Pro Ala Leu Pro Glu Leu Ser Leu His His Asn Pro He 
225 230 235 240 

Val Asp Glu Glu He Leu Ser Pro Leu Thr Gly Lys Lys Pro Leu Leu 
245 250 255 

Leu Thr Asp His Asp Gin Val He Lys Lys Glu Asp Leu Ser Leu Lys 
260 265 270 

He 

<210> 28 
<211> 976 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 28 

aaatcaaatc ttttcattta caattatctt tcttctcaat ttagaactta gtagctagtc 60 
ttcaagataa tggcacttga aactcttact tctccaagat tatcttctcc gatgccgact 120 
ctgtttcaag attcagcact agggtttcat ggaagcaaag gcaaacgatc taagcgatca 180 
agatctgaat tcgaccgtca gagtctcacg gaggatgaat atatcgcttt atgtctcatg 240 
cttcttgctc gcgacggaga tagaaaccgt gaccttgacc tgccttcttc ttcgtcttca 300 
cctcctctgc ttcctcctct tcctactccg atctacaagt gtagcgtctg tgacaaggcg 360 
ttttcgtctt accaggctct tggtggacac aaggcaagtc accggaaaag cttttcgctt 420 
actcaatctg ccggaggaga tgagctgtcg acatcgtcgg cgataaccac gtctggtata 480 
tccggtggcg ggggaggaag tgtgaagtcg cacgtttgct ctatctgtca taaatcgttc 540 
gccaccggtc aagctctcgg cggccacaaa cggtgccact acgaaggaaa gaacggaggc 600 
ggtgtgagta gtagcgtgtc gaattctgaa gatgtggggt ctacaagcca cgtcagcagt 660 
ggccaccgtg ggtttgacct caacataccg ccgataccgg aattctcgat ggtcaacgga 720 
gacgaagagg tgatgagtcc tatgccggcg aagaaactcc ggtttgactt cccggagaaa 780 
ccctaaacat aaacctagga aaaactttac agaattcatt ttataggaaa ttgttttact 
gtatatacaa atatcgattt tgattgatgt tcttcttcac tgaaaaatta tgattctttg 
ttgtataatt gatgtttctg aaaaagatat aactttttat tgtttcacac gtatcaaaat 
ttgcttggat acatca 976 

<210> 29 
<211> 238 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 29 

Met Ala Leu Glu Thr Leu Thr Ser Pro Arg Leu Ser Ser Pro Met Pro 
15 10 15 

Thr Leu Phe Gin Asp Ser Ala Leu Gly Phe His Gly Ser Lys Gly Lys 
20 25 30 

Arg Ser Lys Arg Ser Arg Ser Glu Phe Asp Arg Gin Ser Leu Thr Glu 
35 40 45 

Asp Glu Tyr He Ala Leu Cys Leu Met Leu Leu Ala Arg Asp Gly Asp 
50 55 60 

Arg Asn Arg Asp Leu Asp Leu Pro Ser Ser Ser Ser Ser Pro Pro Leu 
65 70 75 80 



840 
900 
960 
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Leu Pro Pro Leu Pro Thr Pro lie Tyr Lys Cys Ser Val Cys Asp Lys 
85 90 95 

Ala Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser His Arg 
100 105 HO 

Lys Ser Phe Ser Leu Thr Gin Ser Ala Gly Gly Asp Glu Leu Ser Thr 
115 120 125 

Ser Ser Ala lie Thr Thr Ser Gly He Ser Gly Gly Gly Gly Gly Ser 
130 135 140 

Val Lys Ser His Val Cys Ser He Cys His Lys Ser Phe Ala Thr Gly 
145 150 155 160 

Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly Lys Asn Gly 
165 170 175 

Gly Gly Val Ser Ser Ser Val Ser Asn Ser Glu Asp Val Gly Ser Thr 
180 185 190 

Ser His Val Ser Ser Gly His Arg Gly Phe Asp Leu Asn He Pro Pro 
195 200 205 

He Pro Glu Phe Ser Met Val Asn Gly Asp Glu Glu Val Met Ser Pro 
210 215 220 

Met Pro Ala Lys Lys Leu Arg Phe Asp Phe Pro Glu Lys Pro 
225 230 235 

<210> 30 

<211> 718 

<212> DNA 

<213> Arabidopsis thaliana 



60 
120 
180 



<400> 30 ... 
atggctctcg acactctcaa ttctcccacc tccaccacca caaccaccgc tcctcctcct 

ttcctccgtt gcctcgacga aaccgagccc gaaaacctcg aatcatggac caaaagaaaa 

cgtacaaaac gtcaccgtat agatcaacca aaccctcctc cttctgaaga agagtatctc 

gctctttgcc tccttatgct cgctcgtggc tcctccgatc atcactctcc accgtcggat 240 

catcactctc tttctccact gtccgatcat cagaaagatt acaagtgttc cgtctgtggc 300 

aaatctttcc cgtcttacca agcgttaggt ggacacaaaa caagtcaccg gaaaccggtt 360 

agtgtcgatg ttaataatag taacggaacc gttactaata acggaaatat tagtaacggt 420 

ttagttggtc aaagtgggaa gactcataac tgctctatat gttttaagtc gtttccctct 480 

ggtcaagcat tgggtggtca caaacgttgt cactatgatg gtggtaacgg taacagtaac 540 

ggtgacaata gccacaagtt tgacctaaat ttaccggctg atcaagttag tgatgagaca 600 

attggaaaaa gtcaactctc cggtgaagaa acaaagtcgg tgttgtgatt attattattt 660 

tttaccgatc gggattagct agtggttgat cattagctga gtctgtaatg aaaatgat 718 

<210> 31 
<211> 215 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 31 

Met Ala Leu Asp Thr Leu Asn Ser Pro Thr Ser Thr Thr Thr Thr Thr 
15 10 15 
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Ala Pro Pro Pro Phe Leu Arg Cys Leu Asp Glu Thr Glu Pro Glu Asn 
20 25 30 

Leu Glu Ser Trp Thr Lys Arg Lys Arg Thr Lys Arg His Arg He Asp 
35 40 45 

Gin Pro Asn Pro Pro Pro Ser Glu Glu Glu Tyr Leu Ala Leu Cys Leu 
50 55 60 

Leu Met Leu Ala Arg Gly Ser Ser Asp His His Ser Pro Pro Ser Asp 
65 70 75 80 

His His Ser Leu Ser Pro Leu Ser Asp His Gin Lys Asp Tyr Lys Cys 
85 90 95 

Ser Val Cys Gly Lys Ser Phe Pro Ser Tyr Gin Ala Leu Gly Gly His 
100 105 HO 

Lys Thr Ser His Arg Lys Pro Val Ser Val Asp Val Asn Asn Ser Asn 
115 120 125 

Gly Thr Val Thr Asn Asn Gly Asn He Ser Asn Gly Leu Val Gly Gin 
130 135 140 

Ser Gly Lys Thr His Asn Cys Ser He Cys Phe Lys Ser Phe Pro Ser 
145 150 155 160 

Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly Asn 
165 170 175 

Gly Asn Ser Asn Gly Asp Asn Ser His Lys Phe Asp Leu Asn Leu Pro 
180 185 190 

Ala Asp Gin Val Ser Asp Glu Thr He Gly Lys Ser Gin Leu Ser Gly 
195 200 205 

Glu Glu Thr Lys Ser Val Leu 
210 215 

<210> 32 
<211> 702 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 32 

aaattttcta tagcaatggc gcttgaagct cttaattcac caagattggt cgaggatccc 60 
ttaagattca atggcgttga gcagtggacc aaatgtaaga aacgatccaa acgttcgaga 120 
tctgatcttc atcataacca ccgtctcact gaggaagagt atctagcttt ctgtctcatg 180 
cttcttgctc gggatggcgg cgatcttgac tctgtgacgg ttgcggagaa gccgagttat 240 
aagtgtggcg tttgttacaa gacgttttcg tcttaccaag ctctcggcgg tcataaagcg 300 
agccaccgga gcttatacgg tggtggagag aatgataaat cgacaccatc caccgccgtg 360 
aaatctcacg tttgttcggt ttgcgggaaa tctttcgcca ccggtcaagc tctcggcggc 420 
cacaagcggt gccactacga tggtggcgtt tcgaactcgg aaggtgtggg gtctactagc 480 



cacgtcagca gtagtagcca ccgtggattt gaccttaata ttataccggt gcagggattt 540 

tcgccggacg acgaagtgat gagtccgatg gcgactaaga agcctcgcct gaagtaagtc 600 

tttgttgaag acctggaagt ttatcaaatg taaatatcaa atttcaattt caaggaacag 660 

ttttgttgat tctattacca atacacaata cgattcaatt cc 702 
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<210> 33 
<211> 193 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 33 

Met Ala Leu Glu Ala Leu Asn Ser Pro Arg Leu Val Glu Asp Pro Leu 
15 10 15 

Arq Phe Asn Gly Val Glu Gin Trp Thr Lys Cys Lys Lys Arg Ser Lys 
20 25 30 

Arg Ser Arg Ser Asp Leu His His Asn His Arg Leu Thr Glu Glu Glu 
35 40 45 

Tyr Leu Ala Phe Cys Leu Met Leu Leu Ala Arg Asp Gly Gly Asp Leu 
50 ~ 55 60 

Asp Ser Val Thr Val Ala Glu Lys Pro Ser Tyr Lys Cys Gly Val Cys 
65 70 75 80 

Tvr Lys Thr Phe Ser Ser Tyr Gin Ala Leu Gly Gly His Lys Ala Ser 
85 90 95 

His Arg Ser Leu Tyr Gly Gly Gly Glu Asn Asp Lys Ser Thr Pro Ser 
100 105 HO 

Thr Ala Val Lys Ser His Val Cys Ser Val Cys Gly Lys Ser Phe Ala 
115 120 125 

Thr Gly Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly 
130 135 140 

Val Ser Asn Ser Glu Gly Val Gly Ser Thr Ser His Val Ser Ser Ser 
145 150 155 160 

Ser His Arg Gly Phe Asp Leu Asn lie He Pro Val Gin Gly Phe Ser 
165 170 175 

Pro Asp Asp Glu Val Met Ser Pro Met Ala Thr Lys Lys Pro Arg Leu 
180 185 190 

Lys 

<210> 34 
<211> 1157 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 34 

cacacttcac tctttcttca tcttcttctt cttaaatagc tcgaaatcac atctcacaga 
attaaatctt atggctctcg agactctcaa ttctccaaca gctaccacca ccgctcggcc 120 
tcttctccgg tatcgtgaag aaatggagcc tgagaatctc gagcaatggg ctaaaagaaa 180 
acgaacaaaa cgtcaacgtt ttgatcacgg tcatcagaat caagaaacga acaagaacct 240 
tccttctgaa gaagagtatc tcgctctttg tctcctcatg ctcgctcgtg gctccgccgt 300 
acaatctcct cctcttcctc ctctaccgtc acgtgcgtca ccgtccgatc accgagatta 360 
caagtgtacg gtctgtggga agtccttttc gtcataccaa gccttaggtg gacacaagac 420 



60 
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85 



Tvr Lys Cys Thr Val Cys Gly Lys Ser Phe Ser Ser Tyr Gin Ala Leu 
100 105 HO 

Gly Gly His Lys Thr Ser His Arg Lys Pro Thr Asn Thr Ser He Thr 
115 120 125 

Ser Gly Asn Gin Giu Leu Ser Asn Asn Ser His Ser Asn Ser Gly Ser 
130 135 140 

Val Val He Asn Val Thr Val Asn Thr Gly Asn Gly Val Ser Gin Ser 
145 150 155 160 

Gly Lys He His Thr Cys Ser He Cys Phe Lys Ser Phe Ala Ser Gly 
165 170 175 

Gin Ala Leu Gly Gly His Lys Arg Cys His Tyr Asp Gly Gly Asn Asn 
180 185 190 

Gly Asn Gly Asn Gly Ser Ser Ser Asn Ser Val Glu Leu Val Ala Gly 
195 200 205 



1020 
1080 
1140 
1157 



gagtcaccgg aaaccgacga acactagtat cacttccggt aaccaagaac tgtctaataa 480 

cagtcacagt aacagcggtt ccgttgttat taacgttacc gtgaacactg gtaacggtgt 540 

tagtcaaagc ggaaagattc acacttgctc aatctgtttc aagtcgtttg cgtctggtca 600 

agccttaggt ggacacaaac ggtgtcacta tgacggtggc aacaacggta acggtaacgg 660 

aagtagcagc aacagcgtag aactcgtcgc tggtagtgac gtcagcgatg ttgataatga 720 

gagatggtcc gaagaaagtg cgatcggtgg ccaccgtgga tttgacctaa acttaccggc 780 

tgatcaagtc tcagtgacga cttcttaacg ttgactgagt ttgaggaaaa agtcaactat 840 

caagcgaaga aagggttagt ggacggtgaa gattaacggt cgtttctttc cagttgcttc 900 

ggtttgagct tgactgggtc tgtaatgaaa atgattggag tggacttggc attattatta 960 

ttatttttaa aaagaaatgt taatttgttg ttggatttgt ttatagatag aggaaacaat 1020 
tgggatacac aaatattttt tttttttaca aagaaaataa taatgcagag atggatgatt 
ggatcgtaca cgttattata tagtggacca ttctgtaatc gtgaattatt attatttgtt 
agaaatttaa ttttcgt 

<210> 35 
<211> 245 
<212> PRT 

<213> Arabidopsis thaliana 

Met°Ala 3 Leu Glu Thr Leu Asn Ser Pro Thr Ala Thr Thr Thr Ala Arg 
15 10 15 

Pro Leu Leu Arg Tyr Arg Glu Glu Met Glu Pro Glu Asn Leu Glu Gin 
20 25 30 

Trp Ala Lys Arg Lys Arg Thr Lys Arg Gin Arg Phe Asp His Gly His 
35 ~ 40 45 

Gin Asn Gin Glu Thr Asn Lys Asn Leu Pro Ser Glu Glu Glu Tyr Leu 
50 55 60 

Ala Leu Cys Leu Leu Met Leu Ala Arg Gly Ser Ala Val Gin Ser Pro 
65 . 70 75 80 

Pro Leu Pro Pro Leu Pro Ser Arg Ala Ser Pro Ser Asp His Arg Asp 

90 95 
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Ser Asp Val Ser Asp Val Asp Asn Glu Arg Trp Ser Glu Glu Ser Ala 
210 215 220 

lie Gly Gly His Arg Gly Phe Asp Leu Asn Leu Pro Ala Asp Gin Val 
225 230 235 240 

Ser Val Thr Thr Ser 
245 

<210> 36 

<211> 1213 

<212> DNA 

<213> Oryza sativa 



<210> 37 

<211> 269 

<212> PRT 

<213> Oryza sativa 

<400> 37 

Met Ser Ser Ala Ser Ser Met Glu Ala Leu His Ala Ala Val Leu Lys 
1 



5 10 15 



Glu Glu Gin Gin Gin His Glu Val Glu Glu Ala Thr Val Val Thr Ser 
20 25 30 

Ser Ser Ala Thr Ser Gly Glu Glu Gly Gly His Leu Pro Gin Gly Trp 
35 40 45 

Ala Lys Arg Lys Arg Ser Arg Arg Gin Arg Ser Glu Glu Glu Asn Leu 
50 55 60 

Ala Leu Cys Leu Leu Met Leu Ala Arg Gly Gly His His Arg Val Gin 
65 70 75 80 



60 
120 
180 



<400> 36 

aattcggcac gaggccacac agcaaccagc cagctgccac actagcttga ggcgagcgag 
cgaagcttag ctagcggata gaacaagtcg tcgatctgct tgctgctttt gtgaattgcg 
gtggaagcat gtcgagcgcg tcgtccatgg aagcgctcca cgccgcggtg ctcaaggagg 

agcagcagca gcacgaggtg gaggaggcga cggtcgtgac gagcagcagc gccacgagcg 240 

gggaggaggg cggacacctg ccccaggggt gggcgaagcg gaagcggtcg cgccgccagc 300 
gatcggagga ggagaacctc gcgctctgcc tcctcatgct cgcccgcggc ggccaccacc 
gcgtccaggc gccgcctccg ctctcggctt cggcgccccc gccggcaggt gcggagttca 
agtgctccgt ctgcggcaag tccttcagct cctaccaggc gctcggcggc cacaagacga 

gccaccgggt caagctgccg actccgcccg cagctcccgt cttggctccc gcccccgtcg 540 

ccgccttgct gccttccgcc gaggaccgcg agccagccac gtcatccacc gccgcgtcct 600 

ccgacggcat gaccaacaga gtccacaggt gttccatctg ccagaaggag ttccccaccg 660 

ggcaggcgct cggcgggcac aagaggaagc actacgacgg tggcgtaggc gccggcgccg 720 

gcgcatcttc aaccgagctc ctggccacgg tggccgccga gtccgaggtg ggaagctccg 780 

gcaacggcca gtccgccacc cgggcgttcg acctcaacct cccggccgtg ccggagttcg 840 

tgtggcggcc gtgctccaag ggcaagaaga tgtgggacga ggaggaggag gtccagagcc 900 

ccctcgcctt caagaagccc cggcttctca ccgcgtaatt cagcagctgc acggatccga 960 

tccgtcagag tttttgtcta gggagtgaaa ttcagtcgaa acacactatt cgttgattcg 1020 

ttttgtgccg ctattgttta atttgttcct gcttttgtac agagcaagcg agtgatacat 1080 

agccatacat acagtcatac agatataggt ctagctcttc cttggttctt tgtaacactg 1140 

gaactgtacc tgtatctttt acactttgtt ctttgacagt catatattgt agaccaaaaa 1200 

aaaaaaaaaa aaa Xit ' 3 



300 
360 
420 
480 
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Ala Pro Pro Pro Leu Ser Ala Ser Ala Pro Pro Pro Ala Gly Ala Glu 
85 90 95 

Phe Lys Cys Ser Val Cys Gly Lys Ser Phe Ser Ser Tyr Gin Ala Leu 
100 105 HO 

Gly Gly His Lys Thr Ser His Arg Val Lys Leu Pro Thr Pro Pro Ala 
115 120 125 

Ala Pro Val 'Leu Ala Pro Ala Pro Val Ala Ala Leu Leu Pro Ser Ala 
130 135 140 

Glu Asp Arg Glu Pro Ala Thr Ser Ser Thr Ala Ala Ser Ser Asp Gly 
145 " 150 155 160 

Met Thr Asn Arg Val His Arg Cys Ser He Cys Gin Lys Glu Phe Pro 
165 170 175 

Thr Gly Gin Ala Leu Gly Gly His Lys Arg Lys His Tyr Asp Gly Gly 
180 185 190 

Val Gly Ala Gly Ala Gly Ala Ser Ser Thr Glu Leu Leu Ala Thr Val 
195 200 205 

Ala Ala Glu Ser Glu Val Gly Ser Ser Gly Asn Gly Gin Ser Ala Thr 
210 215 220 

Arg Ala Phe Asp Leu Asn Leu Pro Ala Val Pro Glu Phe Val Trp Arg 
225 230 235 240 

Pro Cys Ser Lys Gly Lys Lys Met Trp Asp Glu Glu Glu Glu Val Gin 
245 250 255 

Ser Pro Leu Ala Phe Lys Lys Pro Arg Leu Leu Thr Ala 



<210> 38 
<211> 528 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 38 

atgaagagag accggtccga ttacgaagaa tccatgaagc atatagacat agtagaaagt 
ctaatgatgt tatctcgaag tttcgtggtc aaacaaatcg atgtaaagca atctaccgga 
agcaaaacga accataataa ccacttcgaa tgcaaaacgt gtaaccggaa atttgattcc 
ttccaagctc ttggaggtca tagagctagc cacaagaaac ctaagctgat cgttgaccaa 
gaacaggtga agcatcgtaa caaagagaat gatatgcata agtgtacaat ttgcgatcaa 
atgtttggga ccggtcaagc tctaggcggt cacatgagaa agcataggac gagcatgata 
accgagcaat cgattgtccc ttctgtggtt tattccagac cggtttttaa tcgttgcagt 
agcagcaagg agatcttgga cttaaatcta actccattgg aaaatgatct tgtgttaatc 
tttgggaaga atttggttcc acaaattgat ttgaagtttg tgaattag 

<210> 39 

<211> 175 

<212> PRT 

<213> Arabidopsis thaliana 



60 
120 
180 
240 
300 
360 
420 
480 
528 



260 



2 65 
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<400> 39 

Met Lys Arg Asp Arg Ser Asp Tyr Glu Glu Ser Met Lys His He Asp 
15 10 15 

He Val Glu Ser Leu Met Met Leu Ser Arg Ser Phe Val Val Lys Gin 
20 25 30 

He Asp Val Lys Gin Ser Thr Gly Ser Lys Thr Asn His Asn Asn His 
35 40 45 

Phe Glu Cys Lys Thr Cys Asn Arg Lys Phe Asp Ser Phe Gin Ala Leu 
50 55 60 

Gly Gly His Arg Ala Ser His Lys Lys Pro Lys Leu He Val Asp Gin 
65 ~ 70 75 80 

Glu Gin Val Lys His Arg Asn Lys Glu Asn Asp Met His Lys Cys Thr 
85 90 95 

He Cys Asp Gin Met Phe Gly Thr Gly Gin Ala Leu Gly Gly His Met 
100 105 HO 

Arg Lys His Arg Thr Ser Met He Thr Glu Gin Ser He Val Pro Ser 
115 120 125 

Val Val Tyr Ser Arg Pro Val Phe Asn Arg Cys Ser Ser Ser Lys Glu 
130 135 140 

He Leu Asp Leu Asn Leu Thr Pro Leu Glu Asn Asp Leu Val Leu lie 
145 150 155 160 

Phe Gly Lys Asn Leu Val Pro Gin He Asp Leu Lys Phe Val Asn 
165 170 175 

<210> 40 
<211> 820 
<212> DNA 

<213> Saccharum officinarum 
<220> 

<221> misc_feature 
<222> (406) . . (406) 
<223> n can be any nucleotide 

<220> 

<221> misc_feature 

<222> (581) (582) 

<223> n can be any nucleotide 

<220> 

<221> misc_feature 

<222> (589) . . (589) 

<223> n can be any nucleotide 

<400> 40 

cctaaccagc attagctttt caaatcaaca agcctcgccg tgaccgatcg atggccatca 60 
cccacgacga ctacgtctcc ctctgcctca tggcgctcgc agccgcggga ggcggaggcc 120 
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aagctggttt 
agtccgagct 
tgggcgggca 
cagccggagg 
aggggaggca 
ggcacaagag 
catcggggtc 
cgatggccgc 
ggtcaagaga 
tttttttaat 
ttaatggatt 
ttttttattt 



aacaacgcag 
ccgcttccgg 
caaggccagc 
cgcggcgtcg 
caggtgcacg 
gtgccattac 
cgggtcgacc 
caacgctgcg 
aggcggcttg 
aattaagagt 
tgggtgctgg 
tttacaccct 



tacgctctga 
tgctccgtct 
caccgcaagc 
tcgtcggtaa 
gtgtgccatc 
tgggacgggc 
gtcaagggct 
acaaggtggg 
ccggtccgtc 
tcttttgaag 
cgaaatttta 
tttttttttt 



acacggctgc 

gtggcaaggc 

cgacgctcgt« 

caatgacctc 

ggagcttcgc 

tctcggtgtc 

ttgatctgaa 

gagaggagaa 

ttggacccta 

aaggttgtaa 

aaactggatt 

tagaagaaga 



ctggacagcg 
cttcgcgtcg 
acaggcacat 
ggccgtaggc 
gacggngcaa 
gctcaccgcg 
tttggtgccg 
nnaagtcana 
atttaacgat 
agttttcgaa 
taaatttgcg 



acggcgcaag 
caccaggcac 
gcgtcgtcct 
agcagtgggc 
gcgctcggcg 
tcgtcggcgc 
gtgccgcccg 
aacccttggc 
ttagaagtcc 
ccttgttctt 
ctcactcttt 



<210> 41 

<211> 1509 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 41 

ttcctttctc 

cagactggtt 

ctcatcctct 

acctccgccg 

tccctctcct 

accttattca 

atgatcatca 

ttccaagccc 

cctcgaggac 

acagccacgg 

gcggctgtag 

ttctcattgg 

ataacatgca 

taaggggaac 

gtcgcaacaa 

cacattacaa 

tcgcagtccg 

tatgtggatc 

atggtcatgg 

tagaacaatt 

taagtactaa 

ttttgatgga 

aaaatgaatc 

tattgcaaga 

atattatttt 

taaattatg 



ttcctctctc 

caagtctaat 

tcctcctgtt 

tccaccgccg 

cagcctcagc 

agaaccacct 

taacctcgat 

tagtgctcaa 

cactcatcat 

agctgtcgga 

aatcagcaga 

ccctactcag 

gatgcatatg 

acaaccaaca 

cattgaccat 

gagaaaacat 

aggggactgg 

tgatttcaag 

agcctacgga 

agacaatgat 

gtaattcggt 

ggtagggctt 

aagagcactt 

caatgtagca 

gtcattagct 



tctctcttca 

ccttttcacc 

actcctccct 

ccaccaactc 

cccgccaaca 

tcaacctcca 

gacgatgacc 

gagatggcct 

cacgaggaca 

ggaggagaag 

ctcaacaagg 

ttctcatgtc 

tggggacatg 

ggaatgctaa 

ccaagggcaa 

gggatcaaac 

agaacacatg 

cacaagagat 

attgatgggt 

catgagtcaa 

atatatatta 

ttcaaactaa 

ggaaaatttt 

gtctaaccaa 

tcacctttcg 



ccatgactga 

attaccctaa 

cttccttctt 

ctcctccttc 

aacaacaaga 

tggatgtcga 

atgacgtcac 

ctttgctcat 

tgaatcacaa 

atgacgatga 

gtcaatattg 

ctgtttgctt 

gatcacaata 

ggcttccgtg 

agcctctcaa 

ctttcatgtg 

agaagaattg 

ctctcaaaga 

ttgatgaaga 

tgcagtctaa 

attataagaa 

aagctatatc 

aaattgtatc 

tgaggttccc 

ttaattcgaa 



tccttattcc 

ttcctccact 

cttcttccct 

tcctcctctc 

ccaccatcac 

ctacgatcat 

cgttgctctt 

gatgtcttct 

gaaagacctc 

agattcagtc 

gatccctaca 

caaaaccttc 

cagaaaagga 

ctattgctgc 

agacttcaga 

taggaaatgt 

tggcaaactt 

tcacatcaag 

agatgagcct 

atagcttata 

acctaaatct 

atctaattga 

tttagcttcc 

aacggtttat 

ggacataact 



aatttcttca 

aacccctctc 

caatccggag 

cgagaagccc 

aaccatgacc 

caccatcaag 

cacataggcc 

tcttcctctt 

gaccatgagt 

ggcggagacg 

ccttctcaga 

aacagataca 

cctgaatctc 

gccccaggct 

acccttcaaa 

ggaaaggctt 

tggtattgca 

gcttttggga 

gcctctgagg 

tatattacta 

atggaccaag 

tcataggaaa 

tagttaaatt 

ttctatttgt 

tataaatgtt 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
820 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1509 



<210> 42 

<211> 383 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 42 

Met Thr Asp Pro Tyr Ser Asn Phe Phe Thr Asp Trp Phe Lys Ser Asn 
15 10 15 



Pro Phe His His Tyr Pro Asn Ser Ser Thr Asn Pro Ser Pro His Pro 
20 25 30 
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Leu Pro Pro Val Thr Pro Pro Ser Ser Phe Phe Phe Phe Pro Gin Ser 
35 40 45 

Gly Asp Leu Arg Arg Pro Pro Pro Pro Pro Thr Pro Pro Pro Ser Pro 
50 55 60 

Pro Leu Arg Glu Ala Leu Pro Leu Leu Ser Leu Ser Pro Ala Asn Lys 
65 70 75 80 

Gin Gin Asp His His His Asn His Asp His Leu lie Gin Glu Pro Pro 
85 90 95 

Ser Thr Ser Met Asp Val Asp Tyr Asp His His His Gin Asp Asp His 
100 105 HO 

His Asn Leu Asp Asp Asp Asp His Asp Val Thr Val Ala Leu His lie 
115 120 125 

Gly Leu Pro Ser Pro Ser Ala Gin Glu Met Ala Ser Leu Leu Met Met 
130 135 140 

Ser Ser Ser Ser Ser Ser Ser Arg Thr Thr His His His Glu Asp Met 
145 150 155 160 

Asn His Lys Lys Asp Leu Asp His Glu Tyr Ser His Gly Ala Val Gly 
165 170 ' 175 

Gly Gly Glu Asp Asp Asp Glu Asp Ser Val Gly Gly Asp Gly Gly Cys 
180 185 190 

Arg He Ser Arg Leu Asn Lys Gly Gin Tyr Trp He Pro Thr Pro Ser 
195 200 205 

Gin He Leu He Gly Pro Thr Gin Phe Ser Cys Pro Val Cys Phe Lys 
210 215 220 

Thr Phe Asn Arg Tyr Asn Asn Met Gin Met His Met Trp Gly His Gly 
225 230 235 240 

Ser Gin Tyr Arg Lys Gly Pro Glu Ser Leu Arg Gly Thr Gin Pro Thr 
245 250 255 

Gly Met Leu Arg Leu Pro Cys Tyr Cys Cys Ala Pro Gly Cys Arg Asn 
260 * 265 270 

Asn He Asp His Pro Arg Ala Lys Pro Leu Lys Asp Phe Arg Thr Leu 
275 280 285 

Gin Thr His Tyr Lys Arg Lys His Gly He Lys Pro Phe Met Cys Arg 
290 295 300 

Lvs Cvs Gly Lys Ala Phe Ala Val Arg Gly Asp Trp Arg Thr His Glu 
305 310 315 320 

Lys Asn Cys Gly Lys Leu Trp Tyr Cys He Cys Gly Ser Asp Phe Lys 
325 330 335 

His Lys Arg Ser Leu Lys Asp His He Lys Ala Phe Gly Asn Gly His 
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340 345 350 

Gly Ala Tyr Gly He Asp Gly Phe Asp Glu Glu Asp Glu Pro Ala Ser 
355 360 365 

Glu Val Glu Gin Leu Asp Asn Asp His Glu Ser Met Gin Ser Lys 
370 375 380 

<210> 43 
<211> 1303 
<212> DNA 

<213> Arabidopsis thaliana 

<400> 43 _ A 

atctacacac tactactcac atctcatctc tctctagcac atacccatca aaccatatag 60 

atacggtgct tttattcttg atcttcttct tcttctttgt cttctcctca gagtcatgtc 120 

taatccagct tgttcgaatc tcttcaacaa tggatgtgac cataatagct tcaactattc 180 

cacttctctc tcttacattt acaactctca cggtagctac tattactcta ataccacaaa 240 

ccctaattac attaatcata ctcataccac ttccacttcc cctaactcac ccccactaag 300 

agaagctctt cctcttctta gcttaagccc cataaggcac caagaacaac aagaccaaca 360 

ctatttcatg gacacccatc aaattagctc ttcaaacttt cttgatgatc ctcttgtgac 420 

tgtggatctt catctagggt taccaaacta cggtgttggt gagagcatta ggagcaatat 480 

tgctcctgat gcaaccacgg acgagcaaga tcaagatcat gaccgaggag tagaagtcac 540 

agttgagtcc caccttgatg atgatgatga tcatcatgga gatctacaca gaggtcatca 600 

ctattggatt cctactcctt ctcagatttt gattggtcct acacagttca cttgtcctct 660 

ttgcttcaag acattcaaca gatacaacaa catgcagatg cacatgtggg gacacggctc 720 

acaatacaga aagggaccag aatccttaag aggaacccaa ccaacaggaa tgctaagact 780 

accatgtttc tgctgtgcac ccggttgcaa gaacaacatt gaccacccac gagccaagcc 840 

tcttaaggac tttcgaaccc tccaaacaca ttacaaacgt aaacatgggt ctaaaccatt 900 

tgcttgtcgt atgtgtggta aggcctttgc agtgaaagga gattggagaa cgcatgagaa 960 

gaattgtgga aagctttggt attgctcttg tggctcggat tttaagcaca agaggtcgct 1020 

taaggaccat gtcaaggcct ttggaaatgg tcatgttcct tgtgggattg atagttttgg 1080 

aggagatcat gaggactact atgatgctgc ttctgatatc gagcaataag atgatagcaa 1140 

caacaatgag tgttaattag gggttttgtt tatttttcct ctcatgcatt agttgattgt 1200 

atgcacgtgt tctttagttt tgttcttcgg atctttgttt tattttgttt tgagctgttt 12 60 

tttttttaat tactaagaag ttaattatca tctaaagatt ttc 1303 

<210> 44 

<211> 337 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 44 

Met Ser Asn Pro Ala Cys Ser Asn Leu Phe Asn Asn Gly Cys Asp His 
1 5 10 15 

Asn Ser Phe Asn Tyr Ser Thr Ser Leu Ser Tyr He Tyr Asn Ser His 
20 25 30 

Gly Ser Tyr Tyr Tyr Ser Asn Thr Thr Asn Pro Asn Tyr He Asn His 
35 40 45 

Thr His Thr Thr Ser Thr Ser Pro Asn Ser Pro Pro Leu Arg Glu Ala 
50 55 60 



Leu Pro Leu Leu Ser Leu Ser Pro lie Arg His Gin Glu Gin Gin Asp 
65 70 75 80 
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Gin His Tyr Phe Met Asp Thr His Gin lie Ser Ser Ser Asn Phe Leu 
85 90 95 

Asp Asp Pro Leu Val Thr Val Asp Leu His Leu Gly Leu Pro Asn Tyr 
100 105 110 

Gly Val Gly Glu Ser lie Arg Ser Asn lie Ala Pro Asp Ala Thr Thr 
115 120 125 

Asp Glu Gin Asp Gin Asp His Asp Arg Gly Val Glu Val Thr Val Glu 
130 135 140 

Ser His Leu Asp Asp Asp Asp Asp His His Gly Asp Leu His Arg Gly 
145 150 155 160 

His His Tyr Trp He Pro Thr Pro Ser Gin He Leu He Gly Pro Thr 
165 170 175 

Gin Phe Thr Cys Pro Leu Cys Phe Lys Thr Phe Asn Arg Tyr Asn Asn 
180 185 190 

Met Gin Met His Met Trp Gly His Gly Ser Gin Tyr Arg Lys Gly Pro 
195 200 205 

Glu Ser Leu Arg Gly Thr Gin Pro Thr Gly Met Leu Arg Leu Pro Cys 
210 215 220 

Phe Cys Cys Ala Pro Gly Cys Lys Asn Asn He Asp His Pro Arg Ala 
225 230 235 240 

Lys Pro Leu Lys Asp Phe Arg Thr Leu Gin Thr His Tyr Lys Arg Lys 
24 5 250 255 

His Gly Ser Lys Pro Phe Ala Cys Arg Met Cys Gly Lys Ala Phe Ala 
260 265 270 

Val Lys Gly Asp Trp Arg Thr His Glu Lys Asn Cys Gly Lys Leu Trp 
275 280 285 

Tyr Cys Ser Cys Gly Ser Asp Phe Lys His Lys Arg Ser Leu Lys Asp 
290 295 300 

His Val Lys Ala Phe Gly Asn Gly His Val Pro Cys Gly He Asp Ser 
305 310 315 320 

Phe Gly Gly Asp His Glu Asp Tyr Tyr Asp Ala Ala Ser Asp He Glu 
325 330 335 

Gin 



<210> 45 

<211> 495 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 45 

atggttgcga gaagtgagga agttgagata gtggaagata cggcggcgaa atgtttgatg 



60 
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ttgttatcaa gagttggaga atgcggcgga ggaggagaga aacgagtttt ccgatgcaag 120 

acttgtctta aagagttttc gtcgtttcaa gctttgggag gtcatcgtgc aagccacaag 180 

aaactcatta acagtagcga tccatcactt cttggatcct tgtctaacaa gaaaactaaa 240 

acggcgacgt ctcatccttg tccgatatgt ggcgtggagt ttccgatggg gcaagctctt 300 

ggtggtcaca tgaggagaca taggagtgag aaagcctcac caggcacgtt ggttacacgt 360 

tcttttttac cggagacgac gacggtgacg actttgaaaa aatcgagtag tgggaagaga 420 

gtggcttgtt tggacttaga ttcgatggag agtttagtca attggaagtt ggagttggga 480 

agaacgattt cttga 495 

<210> 46 

<211> 164 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 46 

Met Val Ala Arg Ser Glu Glu Val Glu He Val Glu Asp Thr Ala Ala 
1 5 10 15 

Lys Cys Leu Met Leu Leu Ser Arg Val Gly Glu Cys Gly Gly Gly Gly 
20 25 30 

Glu Lys Arg Val Phe Arg Cys Lys Thr Cys Leu Lys Glu Phe Ser Ser 
35 40 45 

Phe Gin. Ala Leu Gly Gly His Arg Ala Ser His Lys Lys Leu He Asn 
50 55 60 

Ser Ser Asp Pro Ser Leu Leu Gly Ser Leu Ser Asn Lys Lys Thr Lys 
65 * 70 75 80 

Thr Ala Thr Ser His Pro Cys Pro He Cys Gly Val Glu Phe Pro Met 
85 90 95 

Gly Gin Ala Leu Gly Gly His Met Arg Arg His Arg Ser Glu Lys Ala 
100 105 HO 

Ser Pro Gly Thr Leu Val Thr Arg Ser Phe Leu Pro Glu Thr Thr Thr 
115 120 125 

Val Thr Thr Leu Lys Lys Ser Ser Ser Gly Lys Arg Val Ala Cys Leu 
130 135 140 

Asp Leu Asp Ser Met Glu Ser Leu Val Asn Trp Lys Leu Glu Leu Gly 
145 150 155 160 

Arg Thr He Ser 



<210> 47 
<211> 1209 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 47 

atggaagacg aacatcaaga tctccataaa cccattaatg gagctttgcg agacctcaag 
attactcggt cacagaaaga aacagaaaag tctacgaacc aacagcaaga tgttacttgt 
tactatggtc taagggaaaa ctcgaagaag aaaacccagg aatctccgga accaatgaag 
aagattttgt ttcgatgcga agaatgtgga aaagggtttc ggtacgagaa atattttaag 
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aatcatcgct cgatgatgca tttatcgccg aacgagaagg tttgtgaaga atccttgatg 300 

actctgtctc gtagccttgg gtttgtgaag aagaagaaaa gatcaagact tggtaggtct 360 

gggaagactt tatttactac gtttcttgaa ccgagttcta tttttgatgc gactgatgaa 420 

gaattagaag tggcggattg tttgattcta ttgtctaaga gtgctcccaa ggttgtagac 480 

gaattgaaaa gtctttctga ggcagtacgt gttactcctg aaacacctga aagtagctat 540 

gatttgggtt gtttgctcaa caagaaaccg agaaaaggtg gtgaattgga atctggggtt 600 

ttaagtaatg agcaaagact tatggaagaa gggtttagta gttatggaac atcgaaagaa 660 

ccagctagct tcttgagaga cgaaaacaga ttggatcagc agaaacggag aaaagatggt 720 

gaatttgaat ccggactttt gagtaatgag caaagactgc tagaagaaga gattactact 780 

cctgtgacat tcaaaggtcc agcgagttcc ttgagacaca agtgtgcttt ggatcgaaat 840 

ggaggtgaat ttggtcctga gtttttgagt aatgagcaaa cactgatgga agaaacatgg 900 

aaagaaccag tgagtttctt agaagataag catgaatttg atcagcggaa aatgcgagaa 960 

gctggcgact ttgaatctag gttttacaga attgagcttg gagtaggagc tatggagtgt 1020 

acttcttcag atactgatat gctcacgcaa tctgataaga agaacgttga gcatcgatgc 1080 

aggttgtgca acaagatatt ctcgtcttat caagctctag ggggtcatca gacgtttcat 1140 

cggatgagca aatgtaagaa caagaagaat ggcatagagg aatcagttga acccaggatg ^00 
actctgtga 

<210> 48 
<211> 402 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 48 

Met Glu Asp Glu His Gin Asp Leu His Lys Pro lie Asn Gly Ala Leu 
15 10 15 

Arg Asp Leu Lys He Thr Arg Ser Gin Lys Glu Thr Glu Lys Ser Thr 
20 25 30 

Asn Gin Gin Gin Asp Val Thr Cys Tyr Tyr Gly Leu Arg Glu Asn Ser 
35 40 45 

Lys Lys Lys Thr Gin Glu Ser Pro Glu Pro Met Lys Lys He Leu Phe 
50 55 60 

Arg Cys Glu Glu Cys Gly Lys Gly Phe Arg Tyr Glu Lys Tyr Phe Lys 
65 70 75 80 

Asn His Arg Ser Met Met His Leu Ser Pro Asn Glu Lys Val Cys Glu 
85 90 95 

Glu Ser Leu Met Thr Leu Ser Arg Ser Leu Gly Phe Val Lys Lys Lys 
100 105 HO 

Lys Arg Ser Arg Leu Gly Arg Ser Gly Lys Thr Leu Phe Thr Thr Phe 
115 120 125 

Leu Glu Pro Ser Ser He Phe Asp Ala Thr Asp Glu Glu Leu Glu Val 
130 135 140 

Ala Asp Cys Leu He Leu Leu Ser Lys Ser Ala Pro Lys Val Val Asp 
145 150 155 160 

Glu Leu Lys Ser Leu Ser Glu Ala Val Arg Val Thr Pro Glu Thr Pro 
165 170 175 

Glu Ser Ser Tyr Asp Leu Gly Cys Leu Leu Asn Lys Lys Pro Arg Lys 
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180 185 190 

Gly Gly Glu Leu Glu Ser Gly Val Leu Ser Asn Glu Gin Arg Leu Met 
195 200 205 

Glu Glu Gly Phe Ser Ser Tyr Gly Thr Ser Lys Glu Pro Ala Ser Phe 
210 215 220 

Leu Arg Asp Glu Asn Arg Leu Asp Gin Gin Lys Arg Arg Lys Asp Gly 
225 230 235 240 

Glu Phe Glu Ser Gly Leu Leu Ser Asn Glu Gin Arg Leu Leu Glu Glu 
245 250 255 

Glu lie Thr Thr Pro Val Thr Phe Lys Gly Pro Ala Ser Ser Leu Arg 
260 265 270 

His Lys Cys Ala Leu Asp Arg Asn Gly Gly Glu Phe Gly Pro Glu Phe 
275 280 285 

Leu Ser Asn Glu Gin Thr Leu Met Glu Glu Thr Trp Lys Glu Pro Val 
290 295 300 

Ser Phe Leu Glu Asp Lys His Glu Phe Asp Gin Arg Lys Met Arg Glu 
305 310 315 320 

Ala Gly Asp Phe Glu Ser Arg Phe Tyr Arg He Glu Leu Gly Val Gly 
325 330 335 

Ala Met Glu Cys Thr Ser Ser Asp Thr Asp Met Leu Thr Gin Ser Asp 
340 345 350 

Lys Lys Asn Val Glu His Arg Cys Arg Leu Cys Asn Lys He Phe Ser 
355 360 365 

Ser Tyr Gin Ala Leu Gly Gly His Gin Thr Phe His Arg Met Ser Lys 
370 375 380 

Cys Lys Asn Lys Lys Asn Gly He Glu Glu Ser Val Glu Pro Arg Met 
385 390 395 400 

Thr Leu 



<210> 4 9 
<211> 1087 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 49 

cttgttagtt cactccacat aataaacacc aaagatttca ttctcttctc cataatttcg 
aagtttcttg aattgggttt gtttcttgat ttgtttcttg aattgggttt tggtcttctt 120 
ttcttactat atttggatat gatgatgggt caagatgagg ttgggagtga tcagacgcaa 180 
atcataaaag ggaaacgtac gaagcgacaa agatcgtctt cgacgtttgt ggtgacggcg 240 
gcgacaacag tgacttcaac aagttcatcg gccggtggaa gtggaggaga aagagctgtt 300 
tcagatgaat acaactcggc ggtttcgtct ccggtgacta ctgattgtac gcaagaagaa 360 
gaagacatgg cgatttgtct catcatgtta gctcgtggga cagttcttcc atcgccggat 420 
ctcaagaact cgagaaaaat tcatcagaag atttcgtcgg agaattctag tttctatgtg 480 



60 
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tacgagtgta aaacgtgtaa ccggacgttt tcgtcgttcc aagcacttgg tggacacaga 540 

gcgagccaca agaagccgag gacgtcgact gaggaaaaga ctagactacc cctgacgcaa 600 

cccaagtcta gtgcatcaga agaagggcaa aacagtcatt tcaaagtttc cggctcagcc 660 

ctagcttcac aggcaagtaa catcatcaac aaggcaaaca aagtacacga gtgttccatc 720 

tgcggttctg agttcacttc cgggcaagct ctcggtggtc acatgaggcg gcacaggaca 780 

gccgtaacca cgattagccc cgttgcagcc accgcagaag taagcagaaa cagtacagag 840 

gaagagattg agatcaatat aggccgttcg atggaacagc agaggaaata tctaccgttg 900 

gatcttaatc taccagcacc aggagatgat ctaagagagt ccaagtttca agggatagta 960 

ttctcagcaa caccagcgtt aatagattgt cattactagt tgtttttttt actacataat 1020 
atgatgaaat atttgtgaat tcttcttact tactactata ttgttgatca aaaaaaaaaa 
aaaaaaa 



<210> 50 
<211> 284 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 50 

Met Gly Gin Asp Glu Val Gly Ser Asp Gin Thr Gin lie lie Lys Gly 
15 10 15 

Lys Arg Thr Lys Arg Gin Arg Ser Ser Ser Thr Phe Val Val Thr Ala 
20 25 30 

Ala Thr Thr Val Thr Ser Thr Ser Ser Ser Ala Gly Gly Ser Gly Gly 
35 40 45 

Glu Arg Ala Val Ser Asp Glu Tyr Asn Ser Ala Val Ser Ser Pro Val 
50 55 60 

Thr Thr Asp Cys Thr Gin Glu Glu Glu Asp Met Ala lie Cys Leu lie 
65 70 75 80 

Met Leu Ala Arg Gly Thr Val Leu Pro Ser Pro Asp Leu Lys Asn Ser 
85 90 95 

Arg Lys lie His Gin Lys He Ser Ser Glu Asn Ser Ser Phe Tyr Val 
100 105 HO 

Tyr Glu Cys Lys Thr Cys Asn Arg Thr Phe Ser Ser Phe Gin Ala Leu 
115 120 125 

Gly Gly His Arg Ala Ser His Lys Lys Pro Arg Thr Ser Thr Glu Glu 
130 = 135 140 

Lys Thr Arg Leu Pro Leu Thr Gin Pro Lys Ser Ser Ala Ser Glu Glu 
145 150 155 160 

Gly Gin Asn Ser His Phe Lys Val Ser Gly Ser Ala Leu Ala Ser Gin 
165 170 175 

Ala Ser Asn He He Asn Lys Ala Asn Lys Val His Glu Cys Ser He 
180 185 190 

Cys Gly Ser Glu Phe Thr Ser Gly Gin Ala Leu Gly Gly His Met Arg 
195 200 205 

Arg His Arg Thr Ala Val Thr Thr He Ser Pro Val Ala Ala Thr Ala 



1080 
1087 
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210 215 

Glu Val Ser Arg Asn Ser Thr Glu 
225 230 

Arg Ser Met Glu Gin Gin Arg Lys 
245 

Pro Ala Pro Gly Asp Asp Leu Arg 
260 

Phe Ser Ala Thr Pro Ala Leu lie 
275 280 



220 

Glu Glu He Glu He Asn He Gly 
235 240 

Tyr Leu Pro Leu Asp Leu Asn Leu 
250 255 

Glu Ser Lys Phe Gin Gly He Val 
265 270 

Asp Cys His Tyr 



